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HUMAN OXIDOREDUCTASE PROTEINS 

TECHNICAL FIELD 

This invention relates to nucleic acid and amino acid sequences of oxidoreductase proteins and 
5 to the use of these sequences in the diagnosis, treatment, and prevention of cell proliferative disorders 
including cancer; endocrine, metabolic, reproductive, neurological, viral, and autoimmune/inflammatory 
disorders, and in the assessment of the effects of exogenous compounds on the expression of nucleic 
acid and amino acid sequences of oxidoreductase proteins. 

10 BACKGROUND OF THE INVENTION 

Many pathways of biogenesis and biodegradation require oxidoreductase (dehydrogenase or 
reductase) activity, coupled to the reduction or oxidation of a donor or acceptor cofactor. Potential 
cofactors include cytochromes, oxygen, disulfide, iron-sulfur proteins, flavin adenine dinucleotide 
(FAD), and the nicotinamide adenine dinucleotides NAD and NADP (Newsholme, E. A. and Leech, 
15 A.R. (1983) Biochemistry for the Medical Sciences , John Wiley and Sons, Chichester, U.K. pp. 779- 
793). 

Reductase activity catalyzes the transfer of electrons between substrate(s) and cofactor(s) with 
concurrent oxidation of the cofactor. The reverse dehydrogenase reaction catalyzes the reduction of a 
cofactor and consequent oxidation of the substrate. Oxidoreductase enzymes are a broad superf amily 

20 of proteins that catalyze numerous reactions in all cells of organisms ranging from bacteria to plants to 
humans. These reactions include metabolism of sugar, certain detoxification reactions in the liver, and 
the synthesis or degradation of fatty acids, amino acids, glucocorticoids, estrogens, androgens, and 
prostaglandins. Different family members arc named according to the direction in which their reactions 
are typically catalyzed; thus they may be referred to as oxidoreductases, oxidases, reductases, or 

25 dehydrogenases. In addition, family members often have distinct cellular localizations, including the 
cytosol, the plasma membrane, mitochondrial inner or outer membrane, and peroxisomes. 

Tetrahydrofolate is a derivatized glutamate molecule that acts as a carrier, providing activated 
one-carbon units to a wide variety of biosynthetic reactions, including synthesis of purines, pyrimidines, 
and the amino acid methionine. Tetrahydrofolate is generated by the activity of a holoenzyme complex 

30 called tetrahydrofolate synthase, which includes three enzyme activities: tetrahydrofolate 
dehydrogenase, tetrahydrofolate cyclohydrolase, and tetrahydrofolate synthetase. Thus, 
tetrahydrofolate dehydrogenase plays an important role in generating building blocks for nucleic and 
amino acids, crucial to proliferating cells. 

Intracellular redox status plays a critical role in the assembly of proteins. A major rate limiting 

35 step in protein folding is the thiol:disulfide exchange necessary for correct protein assembly. Although 
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incubation of reduced, unfolded proteins in buffers containing defined ratios of oxidized and reduced 
thiols can lead to folding into native conformation, the rate of folding is slow, and the attainment of the 
native conformation decreases proportionately with protein size and the number of cysteine residues. 
Certain cellular compartments such as the endoplasmic reticulum of eukaryotes and the periplasmic 
5 space of prokaryotes are maintained in a more oxidized state than the surrounding cytosol. Correct 
disulfide formation can occur in these compartments, but it occurs at a rate that is insufficient for 
normal cell processes and inadequate for synthesizing secreted proteins. 

Protein disulfide isomerases (PDIs), thioredoxins, and glutaredoxins are able to catalyze the 
formation of disulfide bonds and regulate the redox environment in cells to enable the necessary 

1 0 thiol :disulfide exchanges. Each of these classes of molecules has a somewhat different function, but all 
belong to a group of disulfide-containing redox proteins that contain a conserved active-site sequence 
and are ubiquitously distributed in eukaryotes and prokaryotes. PDIs are found in the endoplasmic . 
reticulum of eukaryotes and in the periplasmic space of prokaryotes. PDIs function by exchanging their 
own disulfide for thiols in a folding peptide chain. In contrast, reduced thioredoxins and glutaredoxins 

15 are generally found in the cytoplasm and function by directly reducing disulfides in the substrate 
proteins, Thioredoxin (Trx), a heat-stable, redox-active protein, contains an active site cysteine 
disulfide/dithiol. Oxidized thioredoxin, Trx-S, can be reduced to the dithiol form by NADPH and a 
specific flavoprotein enzyme, thioredoxin reductase. Reduced thioredoxin, Trx-(SH), participates in a 
number of redox reactions mostly linked to reduction of protein disulfides. Trx and thioredoxin 

20 reductase (TR), together with NADPH, form a redox complex in which TR catalyzes the electron 

transport from NADPH to Trx, The reduced thioredoxin then functions as an electron donor in a wide 
variety of different metabolic processes. 

Disulfide-containing redox proteins not only facilitate disulfide formation, but also regulate and 
participate in a wide variety of physiological processes. The thioredoxin system serves, for example, as 

25 a hydrogen donor for ribonucleotide reductase and controls the activity of enzymes by redox reactions. 
Mammalian thioredoxin (MT) acts as a hydrogen donor for ribonucleotide reductase and methionine 
sulfoxide reductase, facilitates refolding of disulfide-containing proteins, and activates the 
glucocorticoid and interleukin-2 receptors. MT also modulates the DNA binding activity of some 
transcription factors either directly (TFIIIC, BZLF1, and NF-kB) or indirectly (AP-1) through the 

30 nuclear factor Ref-1 . The importance of the redox regulation of transcription factors is exemplified by 
the v-fos oncogene where a point mutation of the thioredoxin-modulated cysteine residue results in 
constitutive activation of the AP-1 complex. Thioredoxin, secreted by cells using a leaderless pathway, 
stimulates the proliferation of lymphoid cells, fibroblasts, and a variety of human solid tumor cell lines. 
Furthermore, thioredoxin is an essential component of early pregnancy factor, inhibits human 

35 immunodeficiency virus expression in macrophages, reduces H 2 0 2 , scavenges free radicals, and protects 

2 



WO 01/44448 



PCT/US00/33158 



cells against oxidative stress (Abate, C. et al. (1990) Science 249: 1 157-1 161; Rosen, A. et al. (1995) 
InL Immunol. 7: 625-633; Tagaya, Y. et al. (1989) EMBO J. 8: 757-764; Newman, G. W. (1994) J. 
Expt. Med 180: 359-363; and Makino, Y. (1996) J. Clin. Invest. 98: 2469-2477). 

Short-chain alcohol dehydrogenases (SC ADs) are a family of dehydrogenases that share only 

5 15% to 30% sequence identity, with similarity predominantly in the coenzyme binding domain and the 
substrate binding domain. In addition to the well-known role in detoxification of ethanol, SCADs are 
also involved in synthesis and degradation of fatty acids, steroids, and some prostaglandins, and are 
therefore implicated in a variety of disorders such as lipid storage disease, myopathy, SCAD deficiency, 
and certain genetic disorders. For example, retinol dehydrogenase is a SC AD-family member (Simon, 

10 A. et al. (1995) J. Biol. Chem. 270:1 107-1 112) that converts retinol to retinal, the precursor of retinoic 
acid. Retinoic acid, a regulator of differentiation and apoptosis, has been shown to down-regulate genes 
involved in cell proliferation and inflammation (Chai, X. et al. (1995) J. Biol. Chem. 270:3900-3904). 
In addition, retinol dehydrogenase has been linked to hereditary eye diseases such as autosomal 
recessive childhood-onset severe retinal dystrophy (Simon, A. et al. (1996) Genomics 36:424-430). 

15 Propagation of nerve impulses, modulation of cell proliferation and differentiation, induction of 

the immune response, and tissue homeostasis involve neurotransmitter metabolism (Weiss, B. (1991) 
Neurotoxicology 12:379-386; Collins, S.M. et al. (1992) Ann. N.Y. Acad Sci. 664:415-424; Brown, 
J.K. and Imam, H. (1991) J. Inherit Metab. Dis. 14:436-458). Many pathways of neurotransmitter 
metabolism require oxidoreductase activity, coupled to reduction or oxidation of a cofactor, such as 

20 NAD7NADH (Newsholme, E.A. and Leech, A.R. (1983) Biochemistry for the Medical Sciences . John 
Wiley and Sons, Chichester, U.K. pp. 779-793). Degradation of catecholamines (epinephrine or 
norepinephrine) requires alcohol dehydrogenase (in the brain) or aldehyde dehydrogenase (in peripheral 
tissue). NAD* -dependent aldehyde dehydrogenase oxidizes 5-hydroxyindole-3-acetate (the product of 
5-hydroxytryptamine (serotonin) metabolism) in the brain, blood platelets, liver and pulmonary 

25 endothelium (Newsholme, E.A. and Leech, A.R. supra p. 786). Other neurotransmitter degradation 
pathways that utilize NADVNADH-dependent oxidoreductase activity include those of L-DOPA 
(precursor of dopamine, a neuronal excitatory compound), glycine (an inhibitory neurotransmitter in the 
brain and spinal cord), histamine (liberated from mast cells during the inflammatory response), and 
taurine (an inhibitory neurotransmitter of the brain stem, spinal cord and retina) (Newsholme, E.A. and 

30 Leech, A.R. supra , pp. 790, 792). Epigenetic or genetic defects in neurotransmitto- metabolic pathways 
can result in a spectrum of disease states in different tissues including Parkinson's disease and inherited 
myoclonus (McCance, K.L. and Huether, S.E. (1994) Pathophysiology . Mosby-Year Book, Inc., St. 
Louis, MO pp. 402-404; Gundlach, A.L. (1990) FASEB J. 4:2761-2766). 

3-Hydroxyacyl-CoA dehydrogenase (3HACD) is involved in fatty acid metabolism. It 

35 catalyzes the reduction of 3-hydroxyacyl-CoA to 3-oxoacyl-CoA, with concomitant oxidation of NAD 

3 
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to NADH, in the mitochondria and peroxisomes of eukaryotic cells. In peroxisomes, 3HACD and 
enoyl-CoA hydratase form an enzyme complex called bifunctional enzyme, defects in which are 
associated with peroxisomal bifunctional enzyme deficiency. This interruption in fatty acid metabolism 
produces accumulation of very-long chain fatty acids, disrupting development of the brain, bone, and 

5 adrenal glands. Infants born with this deficiency typically die within 6 months (Watkins, P. et al. 
(1989) J. Clin. Invest. 83:771-777; Online Mendelian Inheritance in Man (OMIM), #261515). The 
neurodegeneration that is characteristic of Alzheimer's disease involves development of extracellular 
plaques in certain brain regions. A major protein component of these plaques is the peptide amyloid-p 
(AP), which is one of several cleavage products of amyloid precursor protein (APP). 3HACD, which 

10 has been shown to bind the AP peptide, is overexpressed in neurons affected in Alzheimer's disease. In 
addition, an antibody against 3HACD can block the toxic effects of AP in a cell culture model of 
Alzheimer's disease (Yan, S. et al. (1997) Nature 389:689-695; OMIM, #602057). 

17P«hydroxysteroid dehydrogenase (17pHSD6) plays an important role in the regulation of the 
male reproductive hormone, dihydrotestosterone (DHTT). 17PHSD6 acts to reduce levels of DHTT by 

15 oxidizing a precursor of DHTT, 3a-diol, to androsterone which is readily glucuroni dated and removed 
from tissues. 17PHSD6 is active with both androgen and estrogen substrates when expressed in 
embryonic kidney 293 cells. At least five other isozymes of 17PHSD have been identified that catalyze 
oxidation and/or reduction reactions in various tissues with preferences for different steroid substrates 
(Biswas, M.G. and Russell, D.W. (1997) J. Biol. Cheia 272:15959-15966). For example, 17pHSDl 

20 preferentially reduces estradiol and is abundant in the ovary and placenta. 17PHSD2 catalyzes 
oxidation of androgens and is present in the endometrium and placenta. 17PHSD3 is exclusively a 
reductive enzyme in the testis (Geissler, W.M. et al. (1994) Nature Genet. 7:34-39). An excess of 
androgens such as DHTT can contribute to certain disease states such as benign prostatic hyperplasia 
and prostate cancer. 

25 Steroids, such as estrogen, testosterone, corticosterone, and others, are generated from a 

common precursor, cholesterol, and are interconverted into one another. A wide variety of enzymes act 
upon cholesterol, including a number of dehydrogenases. One such dehydrogenase is 3-oxo-5-a-steroid 
dehydrogenase (OASD), a microsomal membrane protein highly expressed in prostate and other 
androgen-responsive tissues. OASD catalyzes the conversion of testosterone into dihydrotestosterone, 

30 which is the most potent androgen. Dihydrotestosterone is essential for the formation of the male 

phenotype during embryogenesis, as well as for proper androgen-mediated growth of tissues such as the 
prostate and male genitalia. A defect in OASD that prevents the convo-sion of testosterone into 
dihydrotestosterone leads to a rare form of male pseudohermaphroditis, characterized by defective 
formation of the external genitalia (Andersson, S., et al. (1991) Nature 354:159-161; Labrie, F., et at. 

35 (1992) Endocrinology 131:1571-1573; OMIM #264600). Thus, OASD plays a central role in sexual 

4 
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differentiation and androgen physiology. 

The discovery of new oxidoreductase proteins and the polynucleotides encoding them satisfies a 
need in the art by providing new compositions which are useful in the diagnosis, prevention, and 
treatment of cell proliferative disorders including cancer; endocrine, metabolic, reproductive, 
5 neurological, viral, and autoimmune/inflammatory disorders, and in the assessment of the effects of 
exogenous compounds on the expression of nucleic acid and amino acid sequences of oxidoreductase 
proteins. 

SUMMARY OF THE INVENTION 

10 The invention features purified polypeptides, oxidoreductase proteins, referred to collectively as 

"ORP" and individually as "ORP-1," "ORP-2," "ORP-3," "ORP-4," "ORP-5," "ORP-6," "ORP-7," 
"ORP-8," "ORP-9," "ORP-10," "ORP-1 1," "ORP-12," "ORP-13," "ORP-14," "ORP-15," "ORP- 
16," "ORP-17," "ORP-18," "ORP-19," "ORP-20," "ORP-21," "ORP-22," "ORP-23," "ORP-24," 
"ORP-25," "ORP-26," "ORP-27." In one aspect, the invention provides an isolated polypeptide 

15 comprising an amino acid sequence selected from the group consisting of a) an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-27, b) a naturally occurring amino acid sequence 
having at least 90% sequence identity to an amino acid sequence selected from the group consisting of 
SEQ ID NO: 1-27, c) a biologically active fragment of an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-27, and d) an immunogenic fragment of an amino acid sequence selected 

20 from the group consisting of SEQ ID NO: 1-27. In one alternative, the invention provides an isolated 
polypeptide comprising the amino acid sequence of SEQ ID NO: 1-27. 

The invention further provides an isolated polynucleotide encoding a polypeptide comprising an 
amino acid sequence selected from the group consisting of a) an amino acid sequence selected from the 
group consisting of SEQ ID NO:l-27, b) a naturally occurring amino acid sequence having at least 

25 90% sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO: 1 - 
27, c) a biologically active fragment of an amino acid sequence selected from the group consisting of 
SEQ ID NO: 1-27, and d) an immunogenic fragment of an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1 -27. In one alternative, the polynucleotide encodes a polypeptide selected 
from the group consisting of SEQ ID NO: 1-27. In another alternative, the polynucleotide is selected 

30 from the group consisting of SEQ ID NO:28-54. 

Additionally, the invention provides a recombinant polynucleotide comprising a promoter 
sequence operably linked to a polynucleotide encoding a polypeptide comprising an amino acid 
sequence selected from the group consisting of a) an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-27, b) a naturally occurring amino acid sequence having at least 90% 

35 sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO: 1-27, c) 

5 
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a biologically active fragment of an amino acid sequence selected from the group consisting of SEQ ID 
NO: 1-27, and d) an immunogenic fragment of an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-27. In one alternative, the invention provides a cell transformed with the 
recombinant polynucleotide. In another alto-native, the invention provides a transgenic organism 

5 comprising the recombinant polynucleotide. 

The invention also provides a method for producing a polypeptide comprising an amino acid 
sequence selected from the group consisting of a) an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-27, b) a naturally occurring amino acid sequence having at least 90% 
sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO:l-27, c) 

10 a biologically active fragment of an amino acid sequence selected from the group consisting of SEQ ID 
NO: 1 -27, and d) an immunogenic fragment of an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-27. The method comprises a) culturing a cell under conditions suitable for 
expression of the polypeptide, wherein said cell is transformed with a recombinant polynucleotide 
comprising a promoter sequence operably linked to a polynucleotide encoding the polypeptide, and b) 

15 recovering the polypeptide so expressed. 

Additionally, the invention provides an isolated antibody which specifically binds to a 
polypeptide comprising an amino acid sequence selected from the group consisting of a) an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1 -27, b) a naturally occurring, amino acid 
sequence having at least 90% sequence identity to an amino acid sequence selected from the group 

20 consisting of SEQ ID NO: 1 -27, c) a biologically active fragment of an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-27, and d) an immunogenic fragment of an amino acid 
sequence selected from the group consisting of SEQ ID NO:l -27. 

The invention further provides an isolated polynucleotide comprising a polynucleotide sequence 
selected from the group consisting of a) a polynucleotide sequence selected from the group consisting of 

25 SEQ ID NO:28-54, b) a naturally occurring polynucleotide sequence having at least 70% sequence 
identity to a polynucleotide sequence selected from the group consisting of SEQ ID NO:28-54, c) a 
polynucleotide sequence complementary to a), d) a polynucleotide sequence complementary to b), and e) 
an RN A equivalent of a)-d). In one alternative, the polynucleotide comprises at least 60 contiguous 
nucleotides. 

30 Additionally, the invention provides a method for detecting a target polynucleotide in a sample, 

said target polynucleotide having a sequence of a polynucleotide comprising a polynucleotide sequence 
selected from the group consisting of a) a polynucleotide sequence selected from the group consisting of 
SEQ ID NO:28-54, b) a naturally occurring polynucleotide sequence having at least 70% sequence 
identity to a polynucleotide sequence selected from the group consisting of SEQ ID NO: 28-54, c) a 

35 polynucleotide sequence complementary to a), d) a polynucleotide sequence complementary to b), and e) 

6 



WO 01/44448 



PCTAJS00/33158 



an RNA equivalent of a)-d). The method comprises a) hybridizing the sample with a probe comprising 
at least 20 contiguous nucleotides comprising a sequence complementary to said target polynucleotide 
in the sample, and which probe specifically hybridizes to said target polynucleotide, under conditions 
whereby a hybridization complex is formed between said probe and said target polynucleotide or 
5 fragments thereof, and b) detecting the presence or absence of said hybridization complex, and 

optionally, if present, the amount thereof. In one alternative, the probe comprises at least 60 contiguous 
nucleotides. 

The invention further provides a method for detecting a target polynucleotide in a sample, said 
target polynucleotide having a sequence of a polynucleotide comprising a polynucleotide sequence 

10 selected from the group consisting of a) a polynucleotide sequence selected from the group consisting of 
SEQ ID NO:28-54, b) a naturally occurring polynucleotide sequence having at least 70% sequence 
identity to a polynucleotide sequence selected from the group consisting of SEQ ID NO:28-54, c) a 
polynucleotide sequence complementary to a), d) a polynucleotide sequence complementary to b), and e) 
an RNA equivalent of a)-d). The method comprises a) amplifying said target polynucleotide or 

15 fragment thereof using polymerase chain reaction amplification, and b) detecting the presence or 
absence of said amplified target polynucleotide or fragment thereof, and, optionally, if present, the 
amount thereof. 

The invention further provides a composition comprising an effective amount of a polypeptide 
comprising an amino acid sequence selected from the group consisting of a) an amino acid sequence 

20 selected from the group consisting of SEQ ID NO: 1 -27, b) a naturally occurring amino acid sequence 
having at least 90% sequence identity to an amino acid sequence selected from the group consisting of 
SEQ ID NO: 1-27, c) a biologically active fragment of an amino acid sequence selected from the group 
consisting of SEQ ID NO:l-27, and d) an immunogenic fragment of an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-27, and a pharmaceutically acceptable excipient In one 

25 embodiment, the composition comprises an amino acid sequence selected from the group consisting of 
SEQ ID NO: 1-27. The invention additionally provides a method of treating a disease or condition 
associated with decreased expression of functional ORP, comprising administering to a patient in need 
of such treatment the composition. 

The invention also provides a method for screening a compound for effectiveness as an 

30 agonist of a polypeptide comprising an amino acid sequence selected from the group consisting of a) an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1-27, b) a naturally occurring 
amino acid sequence having at least 90% sequence identity to an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-27, c) a biologically active fragment of an amino acid sequence 
selected from the group consisting of SEQ ED NO: 1-27, and d) an immunogenic fragment of an amino 

35 acid sequence selected from the group consisting of SEQ ID NO: 1-27. The method comprises a) 

7 
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exposing a sample comprising the polypeptide to a compound, and b) detecting agonist activity in the 
sample. In one alternative, the invention provides a composition comprising an agonist compound 
identified by the method and a pharmaceutical^ acceptable excipient In another alternative, the 
invention provides a method of treating a disease or condition associated with decreased expression of 

5 ftinctional ORP, comprising administering to a patient in need of such treatment the composition. 

Additionally, the invention provides a method for screening a compound for effectiveness as 
an antagonist of a polypeptide comprising an amino acid sequence selected from the group consisting 
of a) an amino acid sequence selected from the group consisting of SEQ ID NO:l-27, b) a naturally 
occurring amino acid sequence having at least 90% sequence identity to an amino acid sequence 

10 selected from the group consisting of SEQ ID NO:l-27, c) a biologically active fragment of an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-27, and d) an immunogenic fragment 
of an amino acid sequence selected from the group consisting of SEQ ID NO: 1-27. The method 
comprises a) exposing a sample comprising the polypeptide to a compound, and b) detecting 
antagonist activity in the sample. In one alto-native, the invention provides a composition comprising 

15 an antagonist compound identified by the method and a pharmaceutical^ acceptable excipient. In 
another alternative, the invention provides a method of treating a disease or condition associated with 
overexpression of functional ORP, comprising administering to a patient in need of such treatment the 
composition. 

The invention further provides a method of screening for a compound that specifically binds 
20 to a polypeptide comprising an amino acid sequence selected from the group consisting of a) an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-27, b) a naturally occurring amino 
acid sequence having at least 90% sequence identity to an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-27, c) a biologically active fragment of an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-27, and d) an immunogenic fragment of an amino acid 
25 sequence selected from the group consisting of SEQ ID NO:l -27. The method comprises a) combining 
the polypeptide with at least one test compound under suitable conditions, and b) detecting binding 
of the polypeptide to the test compound, thereby identifying a compound that specifically binds to the 
polypeptide. 

The invention further provides a method of screening tor a compound that modulates the 
30 activity of a polypeptide comprising an amino acid sequence selected from the group consisting of a) 
an amino acid sequence selected from the group consisting of SEQ ID NO: 1-27, b) a naturally 
occurring amino acid sequence having at least 90% sequence identity to an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-27, c) a biologically active fragment of an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-27, and d) an immunogenic 
35 fragment of an amino acid sequence selected from the group consisting of SEQ ID NO: 1 -27. The 
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method comprises a) combining the polypeptide with at least one test compound under conditions 
permissive for the activity of the polypeptide, b) assessing the activity of the polypeptide in the 
presence of the test compound, and c) comparing the activity of the polypeptide in the presence of the 
test compound with the activity of the polypeptide in the absence of the test compound, wherein a 

5 change in the activity of the polypeptide in the presence of the test compound is indicative of a 
compound that modulates the activity of the polypeptide. 

The invention further provides a method for screening a compound for effectiveness in 
altering expression of a target polynucleotide, wherein said target polynucleotide comprises a 
sequence selected from the group consisting of SEQ ID NO: 28-54, the method comprising a) 

10 exposing a sample comprising the target polynucleotide to a compound, and b) detecting altered 
expression of the target polynucleotide. 

The invention further provides a method for assessing toxicity of a test compound, said 
method comprising a) treating a biological sample containing nucleic acids with the test compound; 
b) hybridizing the nucleic acids of the treated biological sample with a probe comprising at least 20 

15 contiguous nucleotides of a polynucleotide comprising a polynucleotide sequence selected from the 
group consisting of i) a polynucleotide sequence selected from the group consisting of SEQ ID 
NO: 28-54, ii) a naturally occurring polynucleotide sequence having at least 70% sequence identity to 
a polynucleotide sequence selected from the group consisting of SEQ ID NO:28-54, iii) a 
polynucleotide sequence complementary to i), iv) a polynucleotide sequence complementary to ii), 

20 and v) an RNA equivalent of i)-iv). Hybridization occurs under conditions whereby a specific 
hybridization complex is formed between said probe and a target polynucleotide in the biological 
sample, said target polynucleotide comprising a polynucleotide sequence selected from the group 
consisting of i) a polynucleotide sequence selected from the group consisting of SEQ ID NO:28-54, 
ii) a naturally occurring polynucleotide sequence having at least 70% sequence identity to a 

25 polynucleotide sequence selected from the group consisting of SEQ ID NO: 28-54, iii) a 

polynucleotide sequence complementary to i), iv) a polynucleotide sequence complementary to ii), 
and v) an RNA equivalent of i)-iv). Alternatively, the target polynucleotide comprises a fragment of 
a polynucleotide sequence selected from the group consisting of i)-v) above; c) quantifying the 
amount of hybridization complex; and d) comparing the amount of hybridization complex in the 

30 treated biological sample with the amount of hybridization complex in an untreated biological 

sample, wherein a difference in the amount of hybridization complex in the treated biological sample 
is indicative of toxicity of the test compound. 

BRIEF DESCRIPTION OF THE TABLES 

35 Table 1 shows polypeptide and nucleotide sequence identification numbers (SEQ ID NOs), 
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clone identification numbers (clone IDs), cDNA libraries, and cDNA fragments used to assemble full- 
length sequences encoding ORP. 

Table 2 shows features of each polypeptide sequence, including potential motifs, homologous 
sequences, and methods, algorithms, and searchable databases used for analysis of ORP. 
5 Table 3 shows selected fragments of each nucleic acid sequence; the tissue-specific expression 

patterns of each nucleic acid sequence as determined by northern analysis; diseases, disorders, or 
conditions associated with these tissues; and the vector into which each cDNA was cloned 

Table 4 describes the tissues used to construct the cDNA libraries from which cDNA clones 
encoding ORP were isolated. 
10 Table 5 shows the tools, programs, and algorithms used to analyze the polynucleotides and 

polypeptides of the invention, along with applicable descriptions, references, and threshold parameters. 

DESCRIPTION OF THE INVENTION 

Before the present proteins, nucleotide sequences, and methods are described, it is understood 
15 that this invention is not limited to the particular machines, materials and methods described, as these 
may vary. It is also to be understood that the terminology used herein is for the purpose of describing 
particular embodiments only, and is not intended to limit the scope of the present invention which will 
be limited only by the appended claims. 

It must be noted that as used herein and in the appended claims, the singular forms "a," "an," 
20 and "the" include plural reference unless the context clearly dictates otherwise. Thus, for example, a 
reference to "a host cell" includes a plurality of such host cells, and a reference to "an antibody" is a 
reference to one or more antibodies and equivalents thereof known to those skilled in the art, and so 
forth. 

Unless defined otherwise, all technical and scientific terms used herein have the same meanings 
25 as commonly understood by one of ordinary skill in the art to which this invention belongs. Although 
any machines, materials, and methods similar or equivalent to those described herein can be used to 
practice or test the present invention, the preferred machines, materials and methods are now described 
All publications mentioned herein are cited for the purpose of describing and disclosing the cell lines, 
protocols, reagents and vectors which are reported in the publications and which might be used in 
30 connection with the inventioa Nothing herein is to be construed as an admission that the invention is 
not entitled to antedate such disclosure by virtue of prior invention. 
DEFINITIONS 

"ORP" refers to the amino acid sequences of substantially purified ORP obtained from any 
species, particularly a mammalian species, including bovine, ovine, porcine, murine, equine, and 
35 human, and from any source, whether natural, synthetic, semi-synthetic, or recombinant. 
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The tenn "agonist" refers to a molecule which intensifies or mimics the biological activity of 
ORP. Agonists may include proteins, nucleic acids, carbohydrates, small molecules, or any other 
compound or composition which modulates the activity of ORP either by directly interacting with ORP 
or by acting on components of the biological pathway in which ORP participates. 

5 An "allelic variant" is an alternative form of the gene encoding ORP. Allelic variants may 

result from at least one mutation in the nucleic acid sequence and may result in altered mRNAs or in 
polypeptides whose structure or function may or may not be altered A gene may have none, one, or 
many allelic variants of its naturally occurring form Common mutational changes which give rise to 
allelic variants are generally ascribed to natural deletions, additions, or substitutions of nucleotides. 

10 Each of these types of changes may occur alone, or in combination with the others, one or mare times in 
a given sequence. 

"Altered" nucleic acid sequences encoding ORP include those sequences with deletions, 
insertions, or substitutions of different nucleotides, resulting in a polypeptide the same as ORP or a 
polypeptide with at least one Junctional characteristic of ORP. Included within this definition are 

15 polymorphisms which may or may not be readily detectable using a particular oligonucleotide probe of 
the polynucleotide encoding ORP, and improper or unexpected hybridization to allelic variants, with a 
locus other than the normal chromosomal locus for the polynucleotide sequence encoding ORP. The 
encoded protein may also be "altered," and may contain deletions, insertions, or substitutions of amino 
acid residues which produce a silent change and result in a functionally equivalent ORP. Deliberate 

20 amino acid substitutions may be made on the basis of similarity in polarity, charge, solubility, 

hydrophobicity, hydrophilicity, and/or the amphipathic nature of the residues, as long as the biological 
or immunological activity of ORP is retained For example, negatively charged amino acids may 
include aspartic acid and glutamic acid, and positively charged amino acids may include lysine and 
arginine. Amino acids with uncharged polar side chains having similar hydrophilicity values may 

25 include: asparagine and glutamine; and serine and threonine. Amino acids with uncharged side chains 
having similar hydrophilicity values may include: leucine, isoleucine, and valine; glycine and alanine; 
and phenylalanine and tyrosine. 

The terms "amino acid" and "amino acid sequence" refer to an oligopeptide, peptide, 
polypeptide, or protein sequence, or a fragment of any of these, and to naturally occurring or synthetic 

30 molecules. Where "amino acid sequence" is recited to refer to a sequence of a naturally occurring 
protein molecule, "amino acid sequence" and like terms are not meant to limit the amino acid sequence 
to the complete native amino acid sequence associated with the recited protein molecule. 

"Amplification" relates to the production of additional copies of a nucleic acid sequence. 
Amplification is generally carried out using polymerase chain reaction (PCR) technologies well known 

35 in the art. 
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The term "antagonist" refers to a molecule which inhibits or attenuates the biological activity of 
ORP. Antagonists may include proteins such as antibodies, nucleic acids, carbohydrates, small 
molecules, or any other compound or composition which modulates the activity of ORP either by 
directly interacting with ORP or by acting on components of the biological pathway in which ORP 
5 participates. 

The term "antibody" refers to intact immunoglobulin molecules as well as to fragments thereof, 
such as Fab, F(ab') 2 , and Fv fragments, which are capable of binding an epitopic determinant 
Antibodies that bind ORP polypeptides can be prepared using intact polypeptides or using fragments 
containing small peptides of interest as the immunizing antigen. The polypeptide or oligopeptide used 

10 to immunize an animal (e.g., a mouse, a rat, or a rabbit) can be derived from the translation of RNA, or 
synthesized chemically, and can be conjugated to a carrier protein if desired Commonly used carriers 
that are chemically coupled to peptides include bovine serum albumin, thyroglobulin, and keyhole 
limpet hemocyanin (KLH). The coupled peptide is then used to immunize the animal. 

The term "antigenic determinant" refers to that region of a molecule (i.e., an epitope) that 

15 makes contact with a particular antibody. When a protein or a fragment of a protein is used to 

immunize a host animal, numerous regions of the protein may induce the production of antibodies which 
bind specifically to antigenic determinants (particular regions or three-dimensional structures on the 
protein). An antigenic determinant may compete with the intact antigen (i.e., the immunogen used to 
elicit the immune response) for binding to an antibody. 

20 The term "antisense" refers to any composition capable of base-pairing with the "sense" 

(coding) strand of a specific nucleic acid sequence. Antisense compositions may include DNA; RNA; 
peptide nucleic acid (PNA); oligonucleotides having modified backbone linkages such as 
phosphorothioates, methylphosphonates, or benzylphosphonates; oligonucleotides having modified 
sugar groups such as 2'-methoxyethyl sugars or 2-methoxyethoxy sugars; or oligonucleotides having 

25 modified bases such as 5 -methyl cytosine, 2'-deoxyuracil, or 7-deaza-2'-deoxyguanosine. Antisense 
molecules may be produced by any method including chemical synthesis or transcription. Once 
introduced into a cell, the complementary antisense molecule base-pairs with a naturally occurring 
nucleic acid sequence produced by the cell to form duplexes which block either transcription or 
translation. The designation "negative" or "minus" can refer to the antisense strand, and the 

30 designation "positive" or "plus" can refer to the sense strand of a reference DNA molecule. 

The term "biologically active" refers to a protein having structural, regulatory, or biochemical 
functions of a naturally occurring molecule. Likewise* "immunologically active" or "immunogenic" 
refers to the capability of the natural, recombinant, or synthetic ORP, or of any oligopeptide thereof, to 
induce a specific immune response in appropriate animals or cells and to bind with specific antibodies. 

35 "Complementary" describes the relationship between two single-stranded nucleic acid 
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sequences that anneal by base-pairing. For example, 5'-AGT-3* pairs with its complement, 
3'-TCA-5'. 

A "composition comprising a given polynucleotide sequence" and a "composition comprising a 
given amino acid sequence" refer broadly to any composition containing the given polynucleotide or 

5 amino acid sequence. The composition may comprise a dry formulation or an aqueous solution. 
Compositions comprising polynucleotide sequences encoding ORP or fragments of ORP may be 
employed as hybridization probes. The probes may be stored in freeze-dried form and may be 
associated with a stabilizing agent such as a carbohydrate. In hybridizations, the probe may be 
deployed in an aqueous solution containing salts (e.g., NaCl), detergents (e.g., sodium dodecyl sulfate; 

10 SDS), and other components (e.g., Denhardt's solution, dry milk, salmon sperm DNA, etc.). 

"Consensus sequence" refers to a nucleic acid sequence which has been subjected to repeated 
DNA sequence analysis to resolve uncalled bases, extended using the XL-PCR kit (Applied Biosystems, 
Foster City CA) in the 5* and/or the 3' direction, and resequence^ or which has been assembled from 
one or more overlapping cDNA, EST, or genomic DNA fragments using a computer program for 

15 fragment assembly, such as the GEL VIEW fragment assembly system (GCG, Madison WI) or Phrap 
(University of Washington, Seattle WA). Some sequences have been both extended and assembled to 
produce the consensus sequence. 

"Conservative amino acid substitutions" are those substitutions that are predicted to least 
interfere with the properties of the original protein, i.e., the structure and especially the function of the 

20 protein is conserved and not significantly changed by such substitutions. The table below shows amino 
acids which may be substituted for an original amino acid in a protein and which are regarded as 
conservative amino acid substitutions. 





Original Residue 


Conservative Substitution 




Ala 


Gly, Ser 


25 


Arg 


His, Lys 




Asn 


Asp, Gin, His 




Asp 


Asn, Glu 




Cys 


Ala, Ser 




Gin 


Asn, Glu, His 


30 


Glu 


Asp, Gin, His 




Gly 


Ala 




His 


Asn, Arg, Gin, Glu 




He 


Leu, Val 




Leu 


He, Val 


35 


Lys 


Arg, Gin, Glu 




Met 


Leu, He 




Phe 


His, Met, Leu, Trp, Tyr 




Ser 


Cys, Thr 




Thr 


Ser, Val 


40 


Trp 


Phe, Tyr 
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Tyr His, Phe, Trp 
Val He, Leu, Thr 

Conservative amino acid substitutions generally maintain (a) the structure of the polypeptide 
5 backbone in the area of the substitution, for example, as a beta sheet or alpha helical conformation, 
(b) the charge or hydrophobicity of the molecule at the site of the substitution, and/or (c) the bulk of the 
side chain. 

A "deletion" refers to a change in the amino acid or nucleotide sequence that results in the 
absence of one or more amino acid residues or nucleotides. 

10 The term "derivative" refers to a chemically modified polynucleotide or polypeptide. Chemical 

modifications of a polynucleotide sequence can include, for example, replacement of hydrogen by an 
alkyl, acyl, hydroxyl, or amino group. A derivative polynucleotide encodes a polypeptide which retains 
at least one biological or immunological function of the natural molecule. A derivative polypeptide is 
one modified by glycosylation, pegylation, or any similar process that retains at least one biological or 

1 5 immunological function of the polypeptide from which it was derived. 

A "detectable label" refers to a reporter molecule or enzyme that is capable of generating a 
measurable signal and is covalently or noncovalently joined to a polynucleotide or polypeptide. 

A "fragment" is a unique portion of ORP or the polynucleotide encoding ORP which is 
identical in sequence to but shorter in length than the parent sequence. A fragment may comprise up 

20 to the entire length of the defined sequence, minus one nucleotide/amino acid residue. For example, a 
fragment may comprise from 5 to 1000 contiguous nucleotides or amino acid residues. A fragment 
used as a probe, primer, antigen, therapeutic molecule, or for other purposes, may be at least 5, 10, 
15, 16, 20, 25, 30, 40, 50, 60, 75, 100, 150, 250 or at least 500 contiguous nucleotides or amino acid 
residues in length. Fragments may be preferentially selected from certain regions of a molecule. For 

25 example, a polypeptide fragment may comprise a certain length of contiguous amino acids selected 
from the first 250 or 500 amino acids (or first 25% or 50% of a polypeptide) as shown in a certain 
defined sequence. Clearly these lengths are exemplary, and any length that is supported by the 
specification, including the Sequence Listing, tables, and figures, may be encompassed by the present 
embodiments. 

30 A fragment of SEQ ID NO:28-54 comprises a region of unique polynucleotide sequence that 

specifically identifies SEQ ID NO:28-54, for example, as distinct from any other sequence in the 
genome from which the fragment was obtained. A fragment of SEQ ID NO:28-54 is useful, for 
example, in hybridization and amplification technologies and in analogous methods that distinguish * 
SEQ ID NO:28-54 from related polynucleotide sequences. The precise length of a fragment of SEQ 

35 ID NO:28-54 and the region of SEQ ID NO:28-54 to which the fragment corresponds are routinely 
determinable by one of ordinary skill in the art based on the intended purpose for the fragment 
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A fragment of SEQ ID NO:l-27 is encoded by a fragment of SEQ ID NO:28-54. A fragment 
of SEQ ID NO: 1-27 comprises a region of unique amino acid sequence that specifically identifies 
SEQ ID NO:l-27. For example, afragment of SEQ ID NO:l-27 is useful as an immunogenic peptide 
for the development of antibodies that specifically recognize SEQ ID NO: 1-27. The precise length of 

5 a fragment of SEQ ID NO: 1-27 and the region of SEQ ID NO: 1 -27 to which the fragment 
corresponds are routinely determinable by one of ordinary skill in the art based on the intended 
purpose for the fragment 

A "full-length" polynucleotide sequence is one containing at least a translation initiation codon 
(e.g., methionine) followed by an open reading frame and a translation termination codoa A "fuB- 

10 length" polynucleotide sequence encodes a "full-length" polypeptide sequence. 

"Homology" refers to sequence similarity or, interchangeably, sequence identity, between two 
or more polynucleotide sequences or two or more polypeptide sequences. 

The terms "percent identity" and "% identity," as applied to polynucleotide sequences, refer to 
the percentage of residue matches between at least two polynucleotide sequences aligned using a 

15 standardized algorithm. Such an algorithm may insert, in a standardized and reproducible way, gaps in 
the sequences being compared in order to optimize alignment between two sequences, and therefore 
achieve a more meaningful comparison of the two sequences. 

Percent identity between polynucleotide sequences may be determined using the default 
parameters of the CLUSTAL V algorithm as incorporated into the MEG ALIGN version 3. 1 2e sequence 

20 alignment program. This program is part of the L ASERGENE software package, a suite of molecular 
biological analysis programs (DNASTAR, Madison WI). CLUSTAL V is described in Higgins, D.G. 
and P.M. Sharp (1989) CABIOS 5:151-153 and in Higgins, D.G. et al. (1992) CABIOS 8:189-191. 
For pairwise alignments of polynucleotide sequences, the default parameters are set as follows: 
Ktuple=2, gap penalty=5, window=4, and "diagonals saved"=4. The "weighted" residue weight table is 

25 selected as the default. Percent identity is reported by CLUSTAL V as the "percent similarity" between 
aligned polynucleotide sequences. 

Alternatively, a suite of commonly used and freely available sequence comparison algorithms is 
provided by the National Center for Biotechnology Information (NCBI) Basic Local Alignment Search 
Tool (BLAST) (AlLschul, S.F. et al. (1990) J. MoL Biol. 215:403-410), which is available from several 

30 sources, including the NCBI, Bethesda, MD, and on the Internet at 

http://www.ncbi.nlm.nih.gov/BLAST/. The BLAST software suite includes various sequence analysis 
programs including "blastn," that is used to align a known polynucleotide sequence with other 
polynucleotide sequences from a variety of databases. Also available is a tool called "BLAST 2 
Sequences" that is used for direct pairwise comparison of two nucleotide sequences. "BLAST 2 

35 Sequences" can be accessed and used interactively at httpy/www.ncbi.nlnLnih.gov/gorf^bl2.html. The 
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"BLAST 2 Sequences" tool can be used for both blastn and blastp (discussed below). BLAST 
programs are commonly used with gap and other parameters set to default settings. For example, to 
compare two nucleotide sequences, one may use blastn with the "BLAST 2 Sequences" tool Version 
2.0.12 (April-21-2000) set at default parameters. Such default parameters may be, for example: 
5 Matrix: BLOSUM62 

Reward for match: 1 

Penalty for mismatch: -2 

Open Gap: 5 and Extension Gap: 2 penalties 

Gap x drop-off: 50 
10 Expect: 10 

Word Size: 11 

Filter: on 

Percent identity may be measured over the length of an entire defined sequence, for example, as 
defined by a particular SEQ ID number, or may be measured over a shorter length, for example, over 

15 the length of a fragment taken from a larger, defined sequence, for instance, a fragment of at least 20, at 
least 30, at least 40, at least 50, at least 70, at least 100, or at least 200 contiguous nucleotides. Such 
lengths are exemplary only, and it is understood that any fragment length supported by the sequences 
shown herein, in the tables, figures, or Sequence Listing, may be used to describe a length over which 
percentage identity may be measured. 

20 Nucleic acid sequences that do not show a high degree of identity may nevertheless encode 

similar amino acid sequences due to the degeneracy of the genetic code. It is understood that changes in 
a nucleic acid sequence can be made using this degeneracy to produce multiple nucleic acid sequences 
that all encode substantially the same proteia 

The phrases "percent identity" and "% identity," as applied to polypeptide sequences, refer to 

25 the percentage of residue matches between at least two polypeptide sequences aligned using a 

standardized algorithm. Methods of polypeptide sequence alignment are well-known. Some alignment 
methods take into account conservative amino acid substitutions. Such conservative substitutions, 
explained in more detail above, generally preserve the charge and hydrophobicity at the site of 
substitution, thus preserving the structure (and therefore function) of the polypeptide. 

30 Percent identity between polypeptide sequences may be determined using the default parameters 

of the CLUSTAL V algorithm as incorporated into the MEG ALIGN version 3.12e sequence alignment 
program (described and referenced above). For pair wise alignments of polypeptide sequences using 
CLUSTAL V, the default parameters arc set as follows: Ktuple=l , gap penalty=3, window=5, and 
"diagonals saved"=5. The PAM250 matrix is selected as the default residue weight table. As with 

35 polynucleotide alignments, the percent identity is reported by CLUSTAL V as the "percent similarity" 
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between aligned polypeptide sequence pairs. 

Alternatively the NCBI BLAST software suite may be used For example, for a pairwise 
comparison of two polypeptide sequences, one may use the "BLAST 2 Sequences" tool Version 2.0.12 
(Apr-21-2000) with blastp set at default parameters. Such default parameters may be, for example: 
5 Matrix: BLOSUM62 

Open Gap: 11 and Extension Gap: 1 penalties 

Gap x drop off: 50 

Expect: 10 

Word Size: 3 
10 Filter: on 

Percent identity may be measured over the length of an entire defined polypeptide sequence, for 
example, as defined by a particular SEQ ID number, or may be measured over a shorter length, for 
example, over the length of a fragment taken from a larger, defined polypeptide sequence, for instance, 
a fragment of at least 15, at least 20, at least 30, at least 40, at least 50, at least 70 or at least 150 

1 5 contiguous residues. Such lengths are exemplary only, and it is understood that any fragment length 
supported by the sequences shown herein, in the tables, figures or Sequence Listing, may be used to 
describe a length over which percentage identity may be measured. 

"Human artificial chromosomes" (HACs) are linear microchromosomes which may contain 
DNA sequences of about 6 kb to 10 Mb in size, and which contain all of the elements required for 

20 chromosome replication, segregation and maintenance. 

The term "humanized antibody" refers to an antibody molecule in which the amino acid 
sequence in the non-antigen binding regions has been altered so that the antibody more closely 
resembles a human antibody, and still retains its original binding ability. 

"Hybridization" refers to the process by which a polynucleotide strand anneals with a 

25 complementary strand through base pairing under defined hybridization conditions. Specific 

hybridization is an indication that two nucleic acid sequences share a high degree of complementarity. 
Specific hybridization complexes form undo- permissive annealing conditions and remain hybridized 
after the "washing" step(s). The washing step(s) is particularly important in determining the stringency 
of the hybridization process, with more stringent conditions allowing less non-specific binding, i.e., 

30 binding between pairs of nucleic acid strands that are not perfectly matched. Permissive conditions for 
annealing of nucleic acid sequences are routinely determinable by one of ordinary skill in the art and 
may be consistent among hybridization experiments, whereas wash conditions may be varied among 
experiments to achieve the desired stringency, and tho-efore hybridization specificity. Permissive 
annealing conditions occur, for example, at 68°C in the presence of about 6 x SSC, about 1% (w/v) 

35 SDS, and about 100 fig/ml sheared, denatured salmon sperm DNA. 

17 



WO 01/44448 



PCT/US00/33158 



Generally, stringency of hybridization is expressed, in part, with reference to the temperature 
under which the wash step is carried out Such wash temperatures are typically selected to be about 
5°C to 20°C lower than the thermal melting point (TJ for the specific sequence at a defined ionic 
strength and pH. The T m is the temperature (under defined ionic strength and pH) at which 50% of the 
5 target sequence hybridizes to a perfectly matched probe. An equation for calculating T m and conditions 
for nucleic acid hybridization are well known and can be found in Sambrook, J. et al, 1989, Molecular 
aoning: A Laboratory Manual . 2 nd ed., vol. 1-3, Cold Spring Harbor Press, Plainview NY; specifically 
see volume 2, chapter 9. 

High stringency conditions for hybridization between polynucleotides of the present invention 

10 include wash conditions of 68°C in the presence of about 0.2 x SSC and about 0. 1 % SDS, for 1 hour. 
Alternatively, temperatures of about 65°C, 60°C, 55°C, or 42°C may be used. SSC concentration may 
be varied from about 0.1 to 2 x SSC, with SDS being present at about 0.1%. Typically, blocking 
reagents are used to block non-specific hybridization. Such blocking reagents include, for instance, 
sheared and denatured salmon sperm DNA at about 100-200 ng/ml. Organic solvent, such as 

15 formamide at a concentration of about 35-50% v/v, may also be used under particular circumstances, 
such as for RN A:DNA hybridizations. Useful variations on these wash conditions will be readily 
apparent to those of ordinary skill in the art. Hybridization, particularly under high stringency 
conditions, may be suggestive of evolutionary similarity between the nucleotides. Such similarity is 
strongly indicative of a similar role for the nucleotides and their encoded polypeptides. 

20 The term "hybridization complex" refers to a complex formed between two nucleic acid 

sequences by virtue of the formation of hydrogen bonds between complementary bases. A hybridization 
complex may be formed in solution (e.g. 5 C 0 t or Rot analysis) or formed between one nucleic acid 
sequence present in solution and another nucleic acid sequence immobilized on a solid support (e.g., 
paper, membranes, filters, chips, pins or glass slides, or any other appropriate substrate to which cells 

25 or their nucleic acids have been fixed). 

Hie words "insertion" and "addition" refer to changes in an amino acid or nucleotide sequence 
resulting in the addition of one or more amino acid residues or nucleotides, respectively. 

"Immune response" can refer to conditions associated with inflammation, trauma, immune 
disorders, or infectious or genetic disease, etc. These conditions can be characterized by expression of 

30 various factors, c.g., cytokines, chemokines, and other signaling molecules, which may affect cellular 
and systemic defense systems. 

An "immunogenic fragment" is a polypeptide or oligopeptide fragment of ORP which is 
capable of eliciting an immune response when introduced into a living organism, for example, a 
mammal. The term "immunogenic fragment" also includes any polypeptide or oligopeptide fragment of 

35 ORP which is useful in any of the antibody production methods disclosed herein or known in the art. 
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The term "microarray" refers to an arrangement of a plurality of polynucleotides, polypeptides, 
or other chemical compounds on a substrate. 

The terms "element" and "array dement" refer to a polynucleotide, polypeptide, or other 
chemical compound having a unique and defined position on a microarray. 
5 The tern "modulate" refers to a change in the activity of ORP. For example, modulation may 

cause an increase or a decrease in protein activity, binding characteristics, or any other biological, 
functional, or immunological properties of ORP. 

The phrases "nucleic acid" and "nucleic acid sequence" refer to a nucleotide, oligonucleotide, 
polynucleotide, or any fragment thereof. These phrases also refer to DNA or RNA of genomic or 
10 synthetic origin which may be single-stranded or double-stranded and may represent the sense or the 
antisense strand, to peptide nucleic acid (PNA), or to any DNA-like or RNA-like material. 

"Operably linked" refers to the situation in which a first nucleic acid sequence is placed in a 
functional relationship with a second nucleic acid sequence. For instance, a promoter is operably 
linked to a coding sequence if the promoter affects the transcription or expression of the coding 
15 sequence. Operably linked DNA sequences may be in close proximity or contiguous and, where 
necessary to join two protein coding regions, in the same reading frame. 

"Peptide nucleic acid" (PNA) refers to an antisense molecule or anti-gene agent which 
comprises an oligonucleotide of at least about 5 nucleotides in length linked to a peptide backbone of 
amino acid residues ending in lysine. The terminal lysine confers solubility to the composition. PNAs 
20 preferentially bind complementary single stranded DNA or RNA and stop transcript elongation, and 
may be pegylated to extend their lifespan in the cell. 

"Post-translational modification" of an ORP may involve lipidation, glycosylation, 
phosphorylation, acetylation, racemization, proteolytic cleavage, and other modifications known in the 
art. These processes may occur synthetically or biochemically. Biochemical modifications will vary by 
25 cell type depending on the enzymatic milieu of ORP. 

"Probe" refers to nucleic acid sequences encoding ORP, their complements, or fragments 
thereof, which are used to detect identical, allelic or related nucleic acid sequences. Probes are 
isolated oligonucleotides or polynucleotides attached to a detectable label or reporter molecule. Typical 
labels include radioactive isotopes, ligands, chemiluminescent agents, and enzymes. "Primers" are 
30 short nucleic acids, usually DNA oligonucleotides, which may be annealed to a target polynucleotide by 
complementary base-pairing. The primer may thai be extended along the target DNA strand by a DNA 
polymerase enzyme. Primer pairs can be used for amplification (and identification) of a nucleic add 
sequence, e.g., by the polymerase chain reaction (PCR). 

Probes and primers as used in the present invention typically comprise at least 15 contiguous 
35 nucleotides of a known sequence. In order to enhance specificity, longer probes and primers may also 
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be employed, such as probes and primers that comprise at least 20, 25, 30, 40, 50, 60, 70, 80, 90, 100, 
or at least 150 consecutive nucleotides of the disclosed nucleic acid sequences. Probes and primers may 
be considerably longer than these examples, and it is understood that any length supported by the 
specification, including the tables, figures, and Sequence Listing, may be used 

5 Methods for preparing and using probes and primers are described in the references, for 

example Sambrook, J. et al. (1989) Molecular Cloning: A Laboratory Manual 2 nd ed, vol. 1-3, Cold 
Spring Harbor Press, Plainview NY; Ausubel, RM. et al. (1987) Current Protocols in Molecular 
Biology. Greene Publ. Assoc. & Wilcy-Interscicnces, New York NY; Innis, M. et al. (1990) PCR 
Protocols. A Guide to Methods and Applications . Academic Press, San Diego CA. PCR primer pairs 

10 can be derived from a known sequence, for example, by using computer programs intended for that 
purpose such as Primer (Version 0.5, 1991, Whitehead Institute for Biomedical Research, Cambridge 
MA). 

Oligonucleotides for use as primers are selected using software known in the art for such 
purpose. For example, OLIGO 4.06 software is useful for the selection of PCR primer pairs of up to 

15 100 nucleotides each, and for the analysis of oligonucleotides and larger polynucleotides of up to 5,000 
nucleotides from an input polynucleotide sequence of up to 32 kilobases. Similar primer selection 
programs have incorporated additional features for expanded capabilities. For example, the PrimOU 
primer selection program (available to the public from the Genome Center at University of Texas South 
West Medical Center, Dallas TX) is capable of choosing specific primers from megabase sequences 

20 and is thus useful for designing primers on a genome-wide scope. The Primer3 primer selection 
program (available to the public from the Whitehead Institute/MIT Center for Genome Research, 
Cambridge MA) allows the user to input a "misprinting library," in which sequences to avoid as primer 
binding sites are user-specified. Primer3 is useful, in particular, for the selection of oligonucleotides for 
microarrays. (The source code for the latter two prima* selection programs may also be obtained from 

25 their respective sources and modified to meet the user's specific needs.) The PrimeGen program 
(available to the public from the UK Human Genome Mapping Project Resource Centre, Cambridge 
UK) designs primers based on multiple sequence alignments, thereby allowing selection of primers that 
hybridize to either the most conserved or least conserved regions of aligned nucleic acid sequences. 
Hence, this program is useful for identification of both unique and conserved oligonucleotides and 

30 polynucleotide fragments. The oligonucleotides and polynucleotide fragments identified by any of the 
above selection methods are useful in hybridization technologies, for example, as PCR or sequencing 
primers, microarray elements, or specific probes to identify fully or partially complementary 
polynucleotides in a sample of nucleic acids. Methods of oligonucleotide selection are not limited to 
those described above. 

35 A "recombinant nucleic acid" is a sequence that is not naturally occurring or has a sequence 

20 



WO 01/44448 



PCTYUS00/33158 



that is made by an artificial combination of two or more otherwise separated segments of sequence. 
This artificial combination is often accomplished by chemical synthesis or, more commonly, by the 
artificial manipulation of isolated segments of nucleic acids, e.g., by genetic engineering techniques 
such as those described in Sambrook, supra . The term recombinant includes nucleic acids that have 
5 been altered solely by addition, substitution, or deletion of a portion of the nucleic acid Frequently, a 
recombinant nucleic acid may include a nucleic acid sequence operably linked to a promoter sequence. 
Such a recombinant nucleic acid may be part of a vector that is used, for example, to transform a cell. 

Alternatively, such recombinant nucleic acids may be part of a viral vector, e.g., based on a 
vaccinia virus, that could be use to vaccinate a mammal wherein the recombinant nucleic acid is 
10 expressed, inducing a protective immunological response in the mammal. 

A "regulatory dement" refers to a nucleic acid sequence usually derived from untranslated 
regions of a gene and includes enhancers, promoters, introns, ami 5' and 3' untranslated regions (UTRs). 
Regulatory elements interact with host or viral proteins which control transcription, translation, or RNA 
stability. 

15 "Reporter molecules" are chemical or biochemical moieties used for labeling a nucleic acid, 

amino acid, or antibody. Reporter molecules include radionuclides; enzymes; fluorescent, 
chemiluminescent, or chromogenic agents; substrates; cofactors; inhibitors; magnetic particles; and 
other moieties known in the art. 

An "RNA equivalent," in reference to a DNA sequence, is composed of the same linear 

20 sequence of nucleotides as the reference DNA sequence with the exception that all occurrences of the 
nitrogenous base thymine are replaced with uracil, and the sugar backbone is composed of ribose 
instead of deoxyribose. 

The term "sample" is used in its broadest sense. A sample suspected of containing nucleic 
acids encoding ORP, or fragments thereof, or ORP itself, may comprise a bodily fluid; an extract from 

25 a cell, chromosome, organdie, or membrane isolated from a cdl; a cdl; genomic DNA, RNA, or 
cDN A, in solution or bound to a substrate; a tissue; a tissue print; etc. 

The terms "specific binding" and "specifically binding" refer to that interaction between a 
protdn or peptide and an agonist, an antibody, an antagonist, a small molecule, or any natural or 
synthetic binding compositioa The interaction is dependent upon the presence of a particular structure 

30 of the protan, e.g., the antigenic determinant or epitope, recognized by the binding molecule. For 

example, if an antibody is specific for epitope "A," the presence of a polypeptide comprising the epitope 
A, or the presence of free unlabeled A, in a reaction containing free labeled A and the antibody will 
reduce the amount of labeled A that binds to the antibody. 

The term "substantially purified" refers to nucldc add or amino acid sequences that are 

35 removed from their natural environment and are isolated or separated, and are at least 60% free, 
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preferably at least 75% free, and most preferably at least 90% free from other components with which 
they are naturally associated. 

A "substitution" refers to the replacement of one or more amino acid residues or nucleotides by 
different amino acid residues or nucleotides, respectively. 

5 "Substrate" refers to any suitable rigid or semi-rigid support including membranes, Alters, 

chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, tubing, plates, polymers, 
microparticles and capillaries. The substrate can have a variety of surface forms, such as wells, 
trenches, pins, channels and pores, to which polynucleotides or polypeptides are bound. 

A "transcript image" refers to the collective pattern of gene expression by a particular cell type 

10 or tissue under given conditions at a given time. 

"Transformation" describes a process by which exogenous DNA is introduced into a recipient 
cell Transformation may occur under natural or artificial conditions according to various methods well 
known in the art, and may rely on any known method for the insertion of foreign nucleic acid sequences 
into a prokaryotic or eukaryotic host cell. The method for transformation is selected based on the type 

15 of host cell being transformed and may include, but is not limited to, bacteriophage or viral infection, 
electroporation, heat shock, lipofection, and particle bombardment. The term "transformed" cells 
includes stably transformed cells in which the inserted DNA is capable of replication either as an 
autonomously replicating plasmid or as part of the host chromosome, as weD as transiently transformed 
cells which express the inserted DNA or RNA for limited periods of time. 

20 A "transgenic organism," as used herein, is any organism, including but not limited to 

animals and plants, in which one or more of the cells of the organism contains heterologous nucleic 
acid introduced by way of human intervention, such as by transgenic techniques well known in the 
ait. The nucleic acid is introduced into the cell, directly or indirectly by introduction into a precursor 
of the cell, by way of deliberate genetic manipulation, such as by microinjection or by infection with 

25 a recombinant virus. The term genetic manipulation does not include classical cross-breeding, or in 
vitro fertilization, but rather is directed to the introduction of a recombinant DNA molecule. The 
transgenic organisms contemplated in accordance with the present invention include bacteria, 
cyanobacteria, fungi, plants, and animals. The isolated DNA of the present invention can be 
introduced into the host by methods known in the art, for example infection, transfection, 

30 transformation or transconjugation. Techniques for transferring the DNA of the present invention 
into such organisms are widely known and provided in references such as Sambrook, J. et al. (1989), 
supra. 

A "variant" of a particular nucleic acid sequence is defined as a nucleic acid sequence having at 
least 40% sequence identity to the particular nucleic acid sequence over a certain length of one of the 
35 nucleic acid sequences using blastn with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07-1999) 
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set at default parameters. Such a pair of nucleic acids may show, for example, at least 50%, at least 
60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 95% or at least 98% or greater 
sequence identity ova* a certain defined length. A variant may be described as, for example, an "allelic" 
(as defined above), "splice," "species," or "polymorphic" variant A splice variant may have significant 

5 identity to a reference molecule, but will generally have a greater or lesser number of polynucleotides 
due to alternative splicing of exons during mRNA processing. The corresponding polypeptide may 
possess additional functional domains or lack domains that are present in the reference molecule. 
Species variants are polynucleotide sequences that vary from one species to another. The resulting 
polypeptides generally will have significant amino acid identity relative to each other. A polymorphic 

10 variant is a variation in the polynucleotide sequence of a particular gene between individuals of a given 
species. Polymorphic variants also may encompass "single nucleotide polymorphisms" (SNPs) in 
which the polynucleotide sequence varies by one nucleotide base. The presence of SNPs may be 
indicative of, for example, a certain population, a disease slate, or a propensity for a disease state. 

A "variant" of a particular polypeptide sequence is defined as a polypeptide sequence having at 

1 5 least 40% sequence identity to the particular polypeptide sequence over a certain length of one of the 
polypeptide sequences using blastp with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07-1999) 
set at default parameters. Such a pair of polypeptides may show, for example, at least 50%, at least 
60%, at least 70%, at least 80%, at least 90%, at least 95%. or at least 98% or greater sequence 
identity over a certain defined length of one of the polypeptides. 

20 THE INVENTION 

The invention is based on the discovery of new human oxidoreductase proteins (ORP), the 
polynucleotides encoding ORP, and the use of these compositions for the diagnosis, treatment, or 
prevention of cell proliferative disorders including cancer; endocrine, metabolic, reproductive, 
neurological, viral, and autoimmune/inflammatory disorders. 

25 Table 1 lists the Incyte clones used to assemble full length nucleotide sequences encoding ORP. 

Columns 1 and 2 show the sequence identification numbers (SEQ ID NOs) of the polypeptide and 
nucleotide sequences, respectively. Column 3 shows the clone IDs of the Incyte clones in which nucleic 
acids encoding each ORP were identified, and column 4 shows the cDNA libraries from which these 
clones were isolated. Column 5 shows Incyte clones and their corresponding cDNA libraries. Clones 

30 for which cDNA libraries are not indicated were derived from pooled cDNA libraries. In some cases, 
GenBank sequence identifiers are also shown in column 5. The Incyte clones and GenBank cDNA 
sequences, where indicated, in column 5 were used to assemble the consensus nucleotide sequence of 
each ORP and are useful as fragments in hybridization technologies. 

The columns of Table 2 show various properties of each of the polypeptides of the invention: 

35 column 1 references the SEQ ID NO; column 2 shows the number of amino acid residues in each 
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polypeptide; column 3 shows potential phosphorylation sites; column 4 shows potential glycosylate 
sites; column 5 shows the amino acid residues comprising signature sequences and motifs; column 6 
shows homologous sequences as identified by BLAST analysis; and column 7 shows analytical methods 
and in some cases, searchable databases to which the analytical methods were applied. The methods of 

5 column 7 were used to characterize each polypeptide through sequence homology and protein motifs. 
The columns of Table 3 show the tissue-specificity and diseases, disorders, or conditions 
associated with nucleotide sequences encoding ORP. The first column of Table 3 lists the nucleotide 
SEQ ID NOs. Column 2 lists fragments of the nucleotide sequences of column 1 . These fragments are 
useful, for example, in hybridization or amplification technologies to identify SEQ ID NO:28-54 and 

10 to distinguish between SEQ ID NO:28-54 and related polynucleotide sequences. The polypeptides 
encoded by these fragments are useful, for example, as immunogenic peptides. Column 3 lists tissue 
categories which express ORP as a fraction of total tissues expressing ORP. Column 4 lists diseases, 
disorders, or conditions associated with those tissues expressing ORP as a fraction of total tissues 
expressing ORP. Column 5 lists the vectors used to subclone each cDNA library. 

15 The columns of Table 4 show descriptions of the tissues used to construct the cDNA libraries 

from which cDN A clones encoding ORP were isolated. Column 1 references the nucleotide SEQ ID 
NOs, column 2 shows the cDNA libraries from which these clones were isolated, and column 3 shows 
the tissue origins and other descriptive information relevant to the cDNA libraries in column 2. 

SEQ ID NO:31 maps to chromosome 3 within the interval from 23.2 to 31.4 centiMorgans. 

20 SEQ ID NO:42 maps to chromosome 22 within the interval from 0 to 40.2 centiMorgans. SEQ ED 
NO:48 maps to chromosome 7 within the interval from 100.5 to 114.5 centiMorgans, to chromosome 
17 within the interval from 67.6 to 69.3 centiMorgans, and to chromosome 17 within the interval 
from 83.8 centiMorgans to the q terminus. SEQ ID NO:53 maps to chromosome 1 1 within the 
interval from 64.9 to 70.9 centiMorgans. 

25 The invention also encompasses ORP variants. A preferred ORP variant is one which has at 

least about 80%, or alternatively at least about 90%, or even at least about 95% amino acid sequence 
identity to the ORP amino acid sequence, and which contains at least one functional or structural 
characteristic of ORP. 

The invention also encompasses polynucleotides which encode ORP. In a particular 

30 embodiment, the invention encompasses a polynucleotide sequence comprising a sequence selected from 
the group consisting of SEQ ID NO:28-54, which encodes ORP. The polynucleotide sequences of SEQ 
ID NO:28-54, as presented in the Sequence Listing, embrace the equivalent RNA sequences, wherein 
occurrences of the nitrogenous base thymine are replaced with uracil, and the sugar backbone is 
composed of ribose instead of deoxyribose. 

35 The invention also encompasses a variant of a polynucleotide sequence encoding ORP. In 
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particular, such a variant polynucleotide sequence will have at least about 70%, or alternatively at least 
about 85%, or even at least about 95% polynucleotide sequence identity to the polynucleotide sequence 
encoding ORP. A particular aspect of the invention encompasses a variant of a polynucleotide 
sequence comprising a sequence selected from the group consisting of SEQ ID NO:28-54 which has at 
5 least about 70%, or alternatively at least about 85%, or even at least about 95% polynucleotide 
sequence identity to a nucleic acid sequence selected from the group consisting of SEQ ID NO:28-54. 
Any one of the polynucleotide variants described above can encode an amino acid sequence which 
contains at least one functional or structural characteristic of ORP. 

It will be appreciated by those skilled in the art that as a result of the degeneracy of the genetic 

10 code, a multitude of polynucleotide sequences encoding ORP, some bearing minimal similarity to the 
polynucleotide sequences of any known and naturally occurring gene, may be produced Thus, the 
invention contemplates each and every possible variation of polynucleotide sequence that could be made 
by selecting combinations based on possible codon choices. These combinations are made in 
accordance with the standard triplet genetic code as applied to the polynucleotide sequence of naturally 

15 occurring ORP, and all such variations are to be considered as being specifically disclosed. 

Although nucleotide sequences which encode ORP and its variants are generally capable of 
hybridizing to the nucleotide sequence of the naturally occurring ORP undo- appropriately selected 
conditions of stringency, it may be advantageous to produce nucleotide sequences encoding ORP or its 
derivatives possessing a substantially different codon usage, e.g., inclusion of non-naturally occurring 

20 codons. Codons may be selected to increase the rate at which expression of the peptide occurs in a 

particular prokaryotic or eukaryotic host in accordance with the frequency with which particular codons 
are utilized by the host. Other reasons for substantially altering the nucleotide sequence encoding ORP 
and its derivatives without altering the encoded amino acid sequences include the production of RNA 
transcripts having more desirable properties, such as a greater half-life, than transcripts produced from 

25 the naturally occurring sequence. 

The invention also encompasses production of DNA sequences which encode ORP and ORP 
derivatives, or fragments thereof, entirely by synthetic chemistry. After production, the synthetic 
sequence may be inserted into any of the many available expression vectors and cell systems using 
reagents well known in the art. Moreover, synthetic chemistry may be used to introduce mutations into 

30 a sequence encoding ORP or any fragment thereof. 

Also encompassed by the invention are polynucleotide sequences that are capable of 
hybridizing to the claimed polynucleotide sequences, and, in particular, to those shown in SEQ ID 
NO:28-54 and fragments thereof under various conditions of stringency. (See, e.g., Wahl, G.M. and 
S.L. Berger (1987) Methods Enzymol. 152:399-407; Kimmel, A.R. (1987) Methods Enzymol. 

35 152:507-511.) Hybridization conditions, including annealing and wash conditions, are described in 
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"Definitions." 

Methods for DN A sequencing are well known in the art and may be used to practice any of the 
embodiments of the inventioa The methods may employ such enzymes as the Klenow fragment of 
DNA polymerase I, SEQUENASE (US Biochemical, Cleveland OH), Taq polymerase (Applied 

5 Biosystems, Foster City CA), thermostable T7 polymerase (Amersham Pharmacia Biotech, Piscataway 
NJ), or combinations of polymerases and proofreading exonucleases such as those found in the 
ELONGASE amplification system (Life Technologies, Gaithersburg MD). Preferably, sequence 
preparation is automated with machines such as the M1CROLAB 2200 liquid transfer system 
(Hamilton, Reno NV), PTC200 thermal cycler (MJ Research, Watertown MA) and ABI CATALYST 

10 800 thermal cycler (Applied Biosystems). Sequencing is then carried out using either the ABI 373 or 
377 DNA sequencing system (Applied Biosystems), the MEG AB ACE 1000 DNA sequencing system 
(Molecular Dynamics, Sunnyvale CA), or other systems known in the art. The resulting sequences are 
analyzed using a variety of algorithms which are well known in the art. (See, e.g., Ausubel, F.M. 
(1997) Short Protocols in Molecular Biology , John Wiley & Sons, New York NY, unit 7.7; Meyers, 

15 R.A. (1995) Molecular Biology and Biotechnology , Wiley VCH, New York NY, pp. 856-853.) 

The nucleic acid sequences encoding ORP may be extended utilizing a partial nucleotide 
sequence and employing various PCR-based methods known in the art to detect upstream sequences, 
such as promoters and regulatory dements. For example, one method which may be employed, 
restriction-site PCR, uses universal and nested primers to amplify unknown sequence from genomic 

20 DNA within a cloning vector. (See, e.g., Sarkar, G. (1993) PCR Methods Applic. 2:31 8-322.) 
Another method, inverse PCR, uses primers that extend in divergent directions to amplify unknown 
sequence from a circularized template. The template is derived from restriction fragments comprising a 
known genomic locus and surrounding sequences. (See, e.g., Triglia, T. et al. (1988) Nucleic Acids 
Res. 1 6:8 1 86.) A third method, capture PCR, involves PCR amplification of DNA fragments adjacent 

25 to known sequences in human and yeast artificial chromosome DNA. (See, e.g., Lagerstrom, M. et al. 
(1991) PCR Methods Applic. 1:1 1 1-1 19.) In this method, multiple restriction enzyme digestions and 
ligations may be used to insert an engineered double-stranded sequence into a region of unknown 
sequence before performing PCR. Other methods which may be used to retrieve unknown sequences 
are known in the art (See, e.g., Parker, J.D. et al. (1991) Nucleic Acids Res. 19:3055-3060). 

30 Additionally, one may use PCR, nested primers, and PROMOTERFINDER libraries (Clontech, Palo 
Alto C A) to walk genomic DNA. This procedure avoids the need to screen libraries and is useful in 
finding intron/exon junctions. For all PCR-based methods, primers may be designed using 
commercially available software, such as OLIGO 4.06 Prima- Analysis software (National Biosciences, 
Plymouth MN) or another appropriate program, to be about 22 to 30 nucleotides in length, to have a 

35 GC content of about 50% or more, and to anneal to the template at temperatures of about 68°C to 
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72°C 

When screening for full-length cDNAs, it is preferable to use libraries that have been 
size-selected to include larger cDNAs. In addition, random-primed libraries, which often include 
sequences containing the 5' regions of genes, are preferable for situations in which an oligo d(T) library 

5 does not yield a full-length cDNA Genomic libraries may be useful for extension of sequence into 5* 
non-transcribed regulatory regions. 

Capillary electrophoresis systems which are commercially available may be used to analyze the 
size or confirm the nucleotide sequence of sequencing or PCR products. In particular, capillary 
sequencing may employ flowable polymers for electrophoretic separation, four different nucleotide- 

1 0 specific, laser-stimulated fluorescent dyes, and a charge coupled device camera for detection of the 
emitted wavelengths. Output/light intensity may be converted to electrical signal using appropriate 
software (e.g., GENOTYPER and SEQUENCE NAVIGATOR, Applied Biosystems), and the entire 
process from loading of samples to computer analysis and electronic data display may be computer 
controlled Capillary electrophoresis is especially preferable for sequencing small DNA fragments 

1 5 which may be present in limited amounts in a particular sample. 

In another embodiment of the invention, polynucleotide sequences or fragments thereof which 
encode ORP may be cloned in recombinant DNA molecules that direct expression of ORP, or fragments 
or functional equivalents thereof, in appropriate host cells. Due to the inherent degeneracy of the 
genetic code, other DNA sequences which encode substantially the same or a functionally equivalent 

20 amino acid sequence may be produced and used to express ORP. 

The nucleotide sequences of the present invention can be engineered using methods generally 
known in the art in order to alto* ORP-encoding sequences for a variety of purposes including, but not 
limited to, modification of the cloning, processing, and/or expression of the gene product. DNA 
shuffling by random fragmentation and PCR reassembly of gene fragments and synthetic 

25 oligonucleotides may be used to engineer the nucleotide sequences. For example, oligonucleotide- 

mediated site-directed mutagenesis may be used to introduce mutations that create new restriction sites, 
alter glycosylation patterns, change codon preference, produce splice variants, and so forth. 

The nucleotides of the present invention may be subjected to DNA shuffling techniques such 
as MOLECULARB REEDING (Maxygen Inc., Santa Clara CA; described in U.S. Patent Number 

30 5,837,458; Chang, C.-C. et al. (1999) Nat. Biotechnol. 17:793-797; Christians, F.C. et al. (1 999) Nat 
Biotechnol. 17:259-264; and Crameri, A et al. (1996) Nat Biotechnol. 14:315-319) to alter or 
improve the biological properties of ORP, such as its biological or enzymatic activity or its ability to 
bind to other molecules or compounds. DNA shuffling is a process by which a library of gene 
variants is produced using PCR-mediated recombination of gene fragments. The library is then 

35 subjected to selection or screening procedures that identify those gene variants with the desired 
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properties. These preferred variants may then be pooled and further subjected to recursive rounds of 
DNA shuffling and selection/screening. Thus, genetic diversity is created through "artificial" 
breeding and rapid molecular evolution. For example, fragments of a single gene containing random 
point mutations may be recombined, screened, and then reshuffled until the desired properties are 
5 optimized. Alternatively, fragments of a given gene may be recombined with fragments of 
homologous genes in the same gene family, either from the same or different species, thereby 
maximizing the genetic diversity of multiple naturally occurring genes in a directed and controllable 
manner. 

In another embodiment, sequences encoding ORP may be synthesized, in whole or in part, 

10 using chemical methods well known in the art (See, e.g„ Caruthers, M.H. et al. (1980) Nucleic Acids 
Symp. Ser. 7:215-223; Horn, T. et al. (1980) Nucleic Acids Symp. Ser. 7:225-232.) Alternatively, 
ORP itself or a fragment thereof may be synthesized using chemical methods. For example, peptide 
synthesis can be performed using various solution-phase or solid-phase techniques. (See, e.g., 
Creighton, T. (1984) Proteins. Structures and Molecular Properties, WH Freeman, New York NY, pp. 

15 55-60; and Roberge, J.Y. et al. (1995) Science 269:202-204.) Automated synthesis may be achieved 
using the AB1 431 A peptide synthesizer (Applied Biosystems). Additionally, the amino acid sequence 
of ORP, or any part thereof, may be altered during direct synthesis and/or combined with sequences 
from other proteins, or any part thereof, to produce a variant polypeptide or a polypeptide having a 
sequence of a naturally occurring polypeptide. 

20 The peptide may be substantially purified by preparative high performance liquid 

chromatography. (See, e.g., Chiez, R.M. and F.Z. Regnier (1990) Methods Enzymol. 182:392-421 .) 
The composition of the synthetic peptides may be confirmed by amino acid analysis or by sequencing. 
(See, e.g., Creighton, supra, pp. 28-53.) 

In order to express a biologically active ORP, the nucleotide sequences encoding ORP or 

25 derivatives thereof may be inserted into an appropriate expression vector, i.e., a vector which contains 
the necessary elements for transcriptional and translational control of the inserted coding sequence in a 
suitable host. These elements include regulatory sequences, such as enhancers, constitutive and 
inducible promoters, and 5* and 3' untranslated regions in the vector and in polynucleotide sequences 
encoding ORP. Such elements may vary in their strength and specificity. Specific initiation signals 

30 may also be used to achieve more efficient translation of sequences encoding ORP. Such signals 
include the ATG initiation codon and adjacent sequences, e.g. the Kozak sequence. In cases where 
sequences encoding ORP and its initiation codon and upstream regulatory sequences are inserted into 
the appropriate expression vector, no additional transcriptional or translational control signals may be 
needed. However, in cases where only coding sequence, or a fragment thereof, is inserted, exogenous 

35 translational control signals including an in-frame ATG initiation codon should be provided by the 
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vector. Exogenous translation^ dements and initiation codons may be of various origins, both natural 
and synthetic. The efficiency of expression may be enhanced by the inclusion of enhancers appropriate 
for the particular host cell system used (See, e.g., Scharf, D. et al. (1994) Results Probl. Cell Differ. 
20:125-162.) 

5 Methods which are well known to those skilled in the art may be used to construct expression 

vectors containing sequences encoding ORP and appropriate transcriptional and translational control 
elements. These methods include in vitro recombinant DNA techniques, synthetic techniques, and in 
vivo genetic recombination. (See, e.g., Sambrook, J. et al. (1989) Molecular Cloninc. A Laboratory 
Manual . Cold Spring Harbor Press, Plainview NY, ch. 4, 8, and 16-17; Ausubd, F.M. et al. (1995) 

10 Current Protocols in Molecular Biology , John Wiley & Sons, New York NY, ch. 9, 13, and 16.) 

A variety of expression vector/host systems may be utilized to contain and express sequences 
encoding ORP. These include, but are not limited to, microorganisms such as bacteria transformed 
with recombinant bacteriophage, plasmid, or cosmid DNA expression vectors; yeast transformed with 
yeast expression vectors; insect cell systems infected with viral expression vectors (e.g., baculovirus); 

15 plant cell systems transformed with viral expression vectors (e.g., cauliflower mosaic virus, CaMV, or 
tobacco mosaic vims, TMV) or with bacterial expression vectors (e.g M Ti or pBR322 plasmids); or 
animal cell systems. (See, e.g., Sambrook, supra ; Ausubel, supra ; Van Heeke, G. and S.M. Schuster 
(1989) J. Biol. Chem. 264:5503-5509; Bitter, G.A. et al. (1987) Methods Enzymol. 153:516-544; 
Scorer, C.A. et al. (1994) Bio/Technology 12:181-184; Engelhard, E.K. et al. (1994) Proc. Natl. 

20 Acad. Sci. USA 91 :3224-3227; Sandig, V. et al. (1996) Hum. Gene Ther. 7:1937-1945; Takamatsu, 
N. (1987) EMBO J. 6:307-311; Coruzzi, G. et al. (1984) EMBO J. 3:1671-1680; Broglie, R. et al. 
(1984) Science 224:838-843; Winter, J. et al. (1991) Results Probl. Cell Differ. 17:85-105; The 
McGraw Hill Yearbook of Science and Technology (1992) McGraw Hill, New York NY, pp. 
191-196; Logan, J. and T. Shenk (1984) Proc. Natl. Acad. Sci. USA 81 :3655-3659; and Harrington, 

25 J.J. et al. (1997) Nat Genet 15:345-355.) Expression vectors derived from retroviruses, 

adenoviruses, or herpes or vaccinia viruses, or from various bacterial plasmids, may be used for 
delivery of nucleotide sequences to the targeted organ, tissue, or cell population. (See, e.g., Di 
Nicola, M. et al. (1998) Cancer Gen. Ther. 5(6):350-356; Yu, M. et al. (1993) Proc. Natl. Acad. Sci. 
USA 90(13):6340-6344; Bullcr, R.M. et al. (1985) Nature 31 7(6040):8 13-815; McGregor, D.P. et al. 

30 (1994) Mol. Immunol. 31(3):219-226; and Vcrma, LM. and N. Somia (1997) Nature 389:239-242.) 
The invention is not limited by the host cell employed. 

In bacterial systems, a number of cloning and expression vectors may be selected depending 
upon the use intended fa* polynucleotide sequences encoding ORP. For example, routine cloning, 
subcloning, and propagation of polynucleotide sequences encoding ORP can be achieved using a 

35 multifunctional E. coli vector such as PBLUESCRffT (Stratagene, La Jolla CA) or PSPORT1 plasmid 
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(Life Technologies). Ligation of sequences encoding ORP into the vector's multiple cloning site 
disrupts the lacL gene, allowing a colorimetric screening procedure for identification of transformed 
bacteria containing recombinant molecules. In addition, these vectors may be useful for in vitro 
transcription, dideoxy sequencing, single strand rescue with helper phage, and creation of nested 
5 deletions in the cloned sequence. (See, e.g., Van Heeke, G. and S.M. Schuster (1989) J. Biol. Chem. 
264:5503-5509.) When large quantities of ORP are needed, e.g. for the production of antibodies, 
vectors which direct high level expression of ORP may be used For example, vectors containing the 
strong, inducible T5 or T7 bacteriophage promoter may be used 

Yeast expression systems may be used for production of ORP. A number of vectors containing 
10 constitutive or inducible promoters, such as alpha factor, alcohol oxidase, and PGH promoters, may be 
used in the yeast Saccharomvces cerevisiae or Pichia pastoris . In addition, such vectors direct either the 
secretion or intracellular mention of expressed protons and enable integration of foreign sequences into 
the host genome for stable propagation. (See, e.g., Ausubel, 1995, supra ; Bitter, supra ; and Scorer, 
supra .) 

15 Plant systems may also be used for expression of ORP. Transcription of sequences encoding 

ORP may be driven viral promoters, eg., the 35S and 19S promoters of CaMV used alone or in 
combination with the omega leader sequence from TMV (Takamatsu, N. (1987) EMBO J. 6:307-31 1). 
Alternatively, plant promoters such as the small subunit of RUBISCO or heat shock promoters may be 
used (See, e.g., Coruzzi, supra ; Broglie, supra ; and Winter, supra .) These constructs can be 

20 introduced into plant ceils by direct DNA transformation or pathogen-mediated transfection. (See, e.g., 
The McGraw Hill Yearbook of Science and Technology (1992) McGraw Hill, New York NY, pp. 
191-196.) 

In mammalian cells, a number of viral-based expression systems may be utilized In cases 
where an adenovirus is used as an expression vector, sequences encoding ORP may be ligated into an 

25 adenovirus transcription/translation complex consisting of the late promoter and tripartite leader 
sequence. Insertion in a non-essential El or E3 region of the viral genome may be used to obtain 
infective virus which expresses ORP in host cells. (See, e.g., Logan, J. and T. Shenk (1984) Proc. Natl. 
Acad Sci. USA 81:3655-3659.) In addition, transcription enhancers, such as the Rous sarcoma virus 
(RS V) enhancer, may be used to increase expression in mammalian host cells. SV40 or EB V-based 

30 vectors may also be used for high-level protein expression. 

Human artificial chromosomes (HACs) may also be employed to deliver larger fragments of 
DNA than can be contained in and expressed from a plasmid HACs of about 6 kb to 10 Mb are 
constructed and delivered via conventional delivery methods (liposomes, poiycationic amino polymers, 
or vesicles) for therapeutic purposes. (See, e.g., Harrington, J.J. et al. (1997) Nat. Genet. 15:345-355.) 

35 For long term production of recombinant proteins in mammalian systems, stable expression of 
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ORP in cell lines is preferred For example, sequences encoding ORP can be transformed into cell lines 
using expression vectors which may contain viral origins of replication and/or endogenous expression 
elements and a selectable marker gene on the same or on a separate vector. Following the introduction 
of the vector, cells may be allowed to grow for about 1 to 2 days in enriched media before being 
5 switched to selective media. The purpose of the selectable marker is to confer resistance to a selective 
agent, and its presence allows growth and recovery of cells which successfully express the introduced 
sequences. Resistant clones of stably transformed cells may be propagated using tissue culture 
techniques appropriate to the cell type. 

Any number of selection systems may be used to recover transformed cell lines. These include, 

10 but are not limited to, the herpes simplex virus thymidine kinase and adenine phosphoribosyltransferase 
genes, for use in tk' and apf cells, respectively. (See, e.g„ Wigler, M et al. (1977) Cell 1 1 :223-232; 
Lowy, I. et aL (1980) Cell 22:817-823.) Also, antimetabolite, antibiotic, or herbicide resistance can be 
used as the basis for selection. For example, dhfr confers resistance to methotrexate; neo confers 
resistance to the aminoglycosides neomycin and G-418; and als and pat confer resistance to 

15 chlorsulftiron and phosphinotricin acetyltransferase, respectively. (See, e.g., Wigler, M. et al. (1980) 
Proc. Nad. Acad Sci. USA 77:3567-3570; Colbere-Garapin, F. et al. (1981) J. Mol. Biol. 150:1-14.) 
Additional selectable genes have been described e.g., trpB and hisD, which alter cellular requirements 
for metabolites. (See, e.g., Hartman, S.C. and R.C. Mulligan (1988) Proc. Nad. Acad. Sci. USA 
85:8047-8051.) Visible markers, e,g., anthocyanins, green fluorescent proteins (GFP; Clontech), 6 

20 glucuronidase and its substrate B-glucuronide, or luciferase and its substrate lucifexin may be used 
These markers can be used not only to identify transformants, but also to quantify the amount of 
transient or stable protein expression attributable to a specific vector system. (See, e.g., Rhodes, C. A. 
(1995) Methods Mol. Biol. 55:121-131.) 

Although the presence/absence of marker gene expression suggests that the gene of interest is 

25 also present, the presence and expression of the gene may need to be confirmed. For example, if the 
sequence encoding ORP is inserted within a marker gene sequence, transformed cells containing 
sequences encoding ORP can be identified by the absence of marker gene function. Alternatively, a 
marker gene can be placed in tandem with a sequence encoding ORP under the control of a single 
promoter. Expression of the marks gene in response to induction or selection usually indicates 

30 expression of the tandem gene as well. 

In general, host cells that contain the nucleic acid sequence encoding ORP and that express 
ORP may be identified by a variety of procedures known to those of skill in the art. These procedures 
include, but are not limited to, DNA-DNA or DNA-RNA hybridizations, PCR amplification, and 
protein bioassay or immunoassay techniques which include membrane, solution, or chip based 

35 technologies for the detection and/or quantification of nucleic acid or protein sequences. 
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Immunological methods for detecting and measuring the expression of ORP using either 
specific polyclonal or monoclonal antibodies are known in the art. Examples of such techniques include 
enzyme-linked immunosorbent assays (ELIS As), radioimmunoassays (RIAs), and fluorescence 
activated cell sorting (FACS). A two-site, monoclonal-based immunoassay utilizing monoclonal 

5 antibodies reactive to two non-interfering epitopes on ORP is preferred, but a competitive binding assay 
may be employed. These and other assays are well known in the art (See, e.g., Hampton, R. et aL 
(1990) Serological Methods, a Laboratory Manual . APS Press, St. Paul MN, Sect IV; Coligan, J.E. et 
al. (1997) Current Protocols in Immunology , Greene Pub. Associates and Wiley-Interscience, New 
York NY; and Pound, J.D. (1998) Immunochemical Protocols . Humana Press, Totowa NJ.) 

10 A wide variety of labels and conjugation techniques are known by those skilled in the art and 

may be used in various nucleic acid and amino acid assays. Means for producing labeled hybridization 
or PCR probes for detecting sequences related to polynucleotides encoding ORP include oligolabeling, 
nick translation, end-labeling, or PCR amplification using a labeled nucleotide. Alternatively, the 
sequences encoding ORP, or any fragments thereof, may be cloned into a vector for the production of 

1 5 an mRNA probe. Such vectors are known in the art, are commercially available, and may be used to 
synthesize RNA probes in vitro by addition of an appropriate RNA polymerase such as T7, T3, cm- SP6 
and labeled nucleotides. These procedures may be conducted using a variety of commercially available 
kits, such as those provided by Amersham Pharmacia Biotech, Promega (Madison WI), and US 
Biochemical. Suitable reporter molecules or labels which may be used for ease of detection include 

20 radionuclides, enzymes, fluorescent, chemiluminescent, or chromogenic agents, as well as substrates, 
cofactors, inhibitors, magnetic particles, and the like. 

Host cells transformed with nucleotide sequences encoding ORP may be cultured under 
conditions suitable for the expression and recovery of the protein from cell culture. The protein 
produced by a transformed cell may be secreted or retained intracellular^ depending on the sequence 

25 and/or the vector used. As will be understood by those of skill in the art, expression vectors containing 
polynucleotides which encode ORP may be designed to contain signal sequences which direct secretion 
of ORP through a prokaryotic or eukaryotic cell membrane. 

In addition, a host cell strain may be chosen for its ability to modulate expression of the 
inserted sequences or to process the expressed protein in the desired fashioa Such modifications of the 

30 polypeptide include, but are not limited to, acetylation, carboxylation, glycosylarion, phosphorylation, 
lipidation, and acylatioa Post-translational processing which cleaves a "prepro" or "pro" form of the 
protein may also be used to specify protein targeting, folding, and/or activity. Different host cells 
which have specific cellular machinery and characteristic mechanisms for post-translational activities 
(e.g., CHO, HeLa, MDCK, HEK293, and WI38) are available from the American Type Culture 

35 Collection (ATCC, Manassas V A) and may be chosen to ensure the correct modification and processing 

32 



WO 01/44448 PC17US00/33158 

of the foreign proteia 

In another embodiment of the invention, natural, modified, or recombinant nucleic acid 
sequences encoding ORP may be ligated to a heterologous sequence resulting in translation of a fusion 
protein in any of the aforementioned host systems. For example, a chimeric ORP protein containing a 

5 heterologous moiety that can be recognized by a commercially available antibody may facilitate the 
screening of peptide libraries for inhibitors of ORP activity. Heterologous protein and peptide moieties 
may also facilitate purification of fusion proteins using commercially available affinity matrices. Such 
moieties include, but are not limited to, glutathione S-transferase (GST), maltose binding protein 
(MBP), thioredoxin (Trx), calmodulin binding peptide (CBP), 6-His, FLAG, c-myc, and hemagglutinin 

10 (HA). GST, MBP, Trx, CBP, and 6-His enable purification of their cognate fusion proteins on 
immobilized glutathione, maltose, phenylarsine oxide, calmodulin, and metal-chelate resins, 
respectively. FLAG, c-myc, and hemagglutinin (HA) enable immunoaffinity purification of fusion 
proteins using commercially available monoclonal and polyclonal antibodies that specifically recognize 
these epitope tags. A fusion protein may also be engineered to contain a proteolytic cleavage site 

15 located between the ORP encoding sequence and the heterologous protein sequence, so that ORP may 
be cleaved away from the heterologous moiety following purificatioa Methods for fusion protein 
expression and purification are discussed in Ausubel (1995, supra , ch. 10). A variety of commercially 
available kits may also be used to facilitate expression and purification of fusion proteins. 

In a further embodiment of the invention, synthesis of radiolabeled ORP may be achieved in 

20 vitro using the TNT rabbit reticulocyte lysate or wheat germ extract system (Promega). These systems 
couple transcription and translation of protein-coding sequences operably associated with the T7, T3, or 
SP6 promoters. Translation takes place in the presence of a radiolabeled amino acid precursor, for 
example, 35 S-methionine. 

ORP of the present invention or fragments thereof may be used to screen for compounds that 

25 specifically bind to ORP. At least one and up to a plurality of test compounds may be screened for 
specific binding to ORP. Examples of test compounds include antibodies, oligonucleotides, proteins 
(e.g., receptors), or small molecules. 

In one embodiment, the compound thus identified is closely related to the natural ligand of 
ORP, e.g., a ligand or fragment thereof, a natural substrate, a structural or functional mimetic, or a 

30 natural binding partner. (See, e.g., Coligan, J.E. et al. (1991) Current Protocols in Immunology 1(2): 
Chapter 5.) Similarly, the compound can be closely related to the natural receptor to which ORP 
binds, or to at least a fragment of the receptor, e.g., the ligand binding site. In either case, the 
compound can be rationally designed using known techniques. In one embodiment, screening for 
these compounds involves producing appropriate cells which express ORP, either as a secreted 

35 protein or on the cell membrane. Preferred cells include cells from mammals, yeast, Drosophila , or E. 
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coli . Cells expressing ORP or cell membrane fractions which contain ORP are then contacted with a 
test compound and binding, stimulation, or inhibition of activity of either ORP or the compound is 
analyzed. 

An assay may simply test binding of a test compound to the polypeptide, wherein binding is 
5 detected by a fluorophore, radioisotope, enzyme conjugate, or other detectable label. For example, 
the assay may comprise the steps of combining at least one test compound with ORP, either in 
solution or affixed to a solid support, and detecting the binding of ORP to the compound. 
Alternatively, the assay may detect or measure binding of a test compound in the presence of a 
labeled competitor. Additionally, the assay may be carried out using cell-free preparations, chemical 
10 libraries, or natural product mixtures, and the test compound(s) may be free in solution or affixed to a 
solid support. 

ORP of the present invention or fragments thereof may be used to screen for compounds that 
modulate the activity of ORP. Such compounds may include agonists, antagonists, or partial or 
inverse agonists. In one embodiment, an assay is performed under conditions permissive for ORP 

15 activity, wherein ORP is combined with at least one test compound, and the activity of ORP in the 
presence of a test compound is compared with the activity of ORP in the absence of the lest 
compound. A change in the activity of ORP in the presence of the test compound is indicative of a 
compound that modulates the activity of ORP. Alternatively, a test compound is combined with an in 
vitro or cell-free system comprising ORP under conditions suitable for ORP activity, and the assay is 

20 performed. In either of these assays, a test compound which modulates the activity of ORP may do 
so indirectly and need not come in direct contact with the test compound. At least one and up to a 
plurality of test compounds may be screened. 

In another embodiment, polynucleotides encoding ORP or their mammalian homologs may 
be "knocked out" in an animal model system using homologous recombination in embryonic stem 

25 (ES) cells. Such techniques are well known in the art and are useful for the generation of animal 
models of human disease. (See, e.g., U.S. Patent No. 5,175,383 and U.S. Patent No. 5,767,337.) For 
example, mouse ES cells, such as the mouse 129/SvJ cell line, are derived from the early mouse 
embryo and grown in culture. The ES cells are transformed with a vector containing the gene of 
interest disrupted by a marker gene, e.g., the neomycin phosphotransferase gene (neo; Capecchi, MR. 

30 (1989) Science 244: 1288-1 292). The vector integrates into the corresponding region of the host 
genome by homologous recombination. Alternatively, homologous recombination takes place using 
the Cre-loxP system to knockout a gene of interest in a tissue- or developmental stage-specific 
manner (Marth, J.D. (1996) Clin. Invest 97:1999-2002; Wagner, K.U. et al. (1997) Nucleic Acids 
Res. 25:4323-4330). Transformed ES cells are identified and microinjected into mouse cell 

35 blastocysts such as those from the C57BL/6 mouse strain. The blastocysts are surgically transferred 
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to pseudopregnant dams, and the resulting chimeric progeny are genotyped and bred to produce 
heterozygous or homozygous strains. Transgenic animals thus generated may be tested with potential 
therapeutic or toxic agents. 

Polynucleotides encoding ORP may also be manipulated in vitro in ES cells derived from 

5 human blastocysts. Human ES cells have the potential to differentiate into at least eight separate cell 
lineages including endoderm, mesoderm, and ectodermal cell types. These cell lineages differentiate 
into, for example, neural cells, hematopoietic lineages, and cardiomyocytes (Thomson, J.A. et al. 
(1998) Science 282:1145-1 147). 

Polynucleotides encoding ORP can also be used to create "knockin" humanized animals 

10 (pigs) or transgenic animals (mice or rats) to model human disease. With knockin technology, a 
region of a polynucleotide encoding ORP is injected into animal ES cells, and the injected sequence 
integrates into the animal cell genome. Transformed cells are injected into blastulae, and the 
blastulae are implanted as described above. Transgenic progeny or inbred lines are studied and 
treated with potential pharmaceutical agents to obtain information on treatment of a human disease. 

15 Alternatively, a mammal inbred to overexpress ORP, e.g., by secreting ORP in its milk, may also 
serve as a convenient source of that protein (Janne, J. ct al. (1998) Biotechnol. Annu. Rev. 4:55-74). 
THERAPEUTICS 

Chemical and structural similarity, e.g., in the context of sequences and motifs, exists 
between regions of ORP and oxidoreductase proteins. In addition, the expression of ORP is closely 

20 associated with cancerous, inflamed, reproductive, and gastrointestinal tissues. Therefore, ORP 
appears to play a role in cell proliferative disorders including cancer; endocrine, metabolic, 
reproductive, neurological, viral, and autoinununefinflammatory disorders. In the treatment of 
disorders associated with increased ORP expression or activity, it is desirable to decrease the 
expression or activity of ORP. In the treatment of disorders associated with decreased ORP 

25 expression or activity, it is desirable to increase the expression or activity of ORP. 

Therefore, in one embodiment, ORP or a fragment or derivative thereof may be administered 
to a subject to treat or prevent a disorder associated with decreased expression or activity of ORP. 
Examples of such disorders include, but are not limited to, a cell proliferative disorder, such as actinic 
keratosis, arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, mixed connective tissue 

30 disease, myelofibrosis, paroxysmal nocturnal hemoglobinuria, polycythemia vera, psoriasis, primary 
thrombocythemia; a cancer, such as adenocarcinoma, leukemia, lymphoma, melanoma, myeloma, 
sarcoma, teratocarcinoma, and, in particular, cancers of the adrenal gland, bladder, bone, bone 
marrow, brain, breast, cervix, gall bladder, ganglia, gastrointestinal tract, heart, kidney, liver, lung, 
muscle, ovary, pancreas, parathyroid, penis, prostate, salivary glands, skin, spleen, testis, thymus, 

35 thyroid, and uterus; an endocrine disorder, such as disorders of the hypothalamus and pituitary 
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resulting from lesions such as primary brain tumors, adenomas, infarction associated with pregnancy, 
hypophysectomy, aneurysms, vascular malformations, thrombosis, infections, immunological 
disorders, and complications due to head trauma, disorders associated with hypopituitarism including 
hypogonadism, Sheehan syndrome, diabetes insipidus, Kallman's disease, Hand-Schuller-Christian 
5 disease, Letterer-Siwe disease, sarcoidosis, empty sella syndrome, and dwarfism, disorders associated 
with hyperpituitarism including acromegaly, giantism, and syndrome of inappropriate antidiuretic 
hormone (ADH) secretion (SIADH) often caused by benign adenoma, disorders associated with 
hypothyroidism including goiter, myxedema, acute thyroiditis associated with bacterial infection, 
subacute thyroiditis associated with viral infection, autoimmune thyroiditis (Hashimoto's disease), 

10 and cretinism, disorders associated with hyperthyroidism including thyrotoxicosis and its various 
forms, Grave's disease, pretibial myxedema, toxic multinodular goiter, thyroid carcinoma, and 
Plummer's disease, disorders associated with hyperparathyroidism including Conn disease (chronic 
hypercalemia), pancreatic disorders such as Type I or Type II diabetes mellitus and associated 
complications, disorders associated with the adrenals such as hyperplasia, carcinoma, or adenoma of 

15 the adrenal cortex, hypertension associated with alkalosis, amyloidosis, hypokalemia, Cushing's 
disease, Liddle's syndrome, and Arnold-Healy-Gordon syndrome, pheochromocytoma tumors, and 
Addison's disease, disorders associated with gonadal steroid hormones such as: in women, abnormal 
prolactin production, infertility, endometriosis, perturbations of the menstrual cycle, polycystic 
ovarian disease, hyperprolactinemia, isolated gonadotropin deficiency, amenorrhea, galactorrhea, 

20 . hermaphroditism, hirsutism and virilization, breast cancer, and, in post-menopausal women, 

osteoporosis, and, in men, Leydig cell deficiency, male climacteric phase, and germinal cell aplasia, 
hypergonadal disorders associated with Leydig cell tumors, androgen resistance associated with 
absence of androgen receptors, syndrome of 5 a-reductase, and gynecomastia; a metabolic disorder, 
such as Addison's disease, cystic fibrosis, diabetes, fatty hepatocirrhosis, galactosemia, goiter, 

25 hyperadrenalism, hypoadrenalism, hyperparathyroidism, hypoparathyroidism, hypercholesterolemia, 
hyperthyroidism, hypothyroidism hyperlipidemia, hyperlipemia, lipid myopathies, obesity, 
lipodystrophies, and phenylketonuria, congenital adrenal hyperplasia, pseudovitamin D-deficiency 
rickets, cerebrotendinous xanthomatosis, and coumarin resistance; a reproductive disorder, such as 
disorders of prolactin production, infertility, including tubal disease, ovulatory defects, and 

30 endometriosis, disruptions of the estrous cycle, disruptions of the menstrual cycle, polycystic ovary 
syndrome, ovarian hyperstimulation syndrome, endometrial and ovarian tumors, uterine fibroids, 
autoimmune disorders, ectopic pregnancies, and teratogenesis; cancer of the breast, fibrocystic breast 
disease, and galactorrhea, disruptions of spermatogenesis, abnormal sperm physiology, cancer of the 
testis, cancer of the prostate, benign prostatic hyperplasia, prostatitis, Peyronie's disease, impotence, 

35 carcinoma of the male breast, and gynecomastia; a neurological disorder, such as epilepsy, ischemic 
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cerebrovascular disease, stroke, cerebral neoplasms, Alzheimer's disease, Pick's disease, 
Huntington's disease, dementia, Parkinson's disease and other extrapyramidal disorders, amyotrophic 
lateral sclerosis and other motor neuron disorders, progressive neural muscular atrophy, retinitis 
pigmentosa, hereditary ataxias, multiple sclerosis and other dcmyclinating diseases, bacterial and viral 

5 meningitis, brain abscess, subdural empyema, epidural abscess, suppurative intracranial 
thrombophlebitis, myelitis and radiculitis, viral central nervous system disease, prion diseases 
including kuni, Creutzfeldt- Jakob disease, and Gerstmann-Straussler-Scheinker syndrome, fatal 
familial insomnia, nutritional and metabolic diseases of the nervous system, neurofibromatosis, 
tuberous sclerosis, cerebelloretinal hcmangioblastomatosis, encephalotrigeminal syndrome, mental 

10 retardation and other developmental disorders of the central nervous system, cerebral palsy, 

neuroskeletal disorders, autonomic nervous system disorders, cranial nerve disorders, spinal cord 
diseases, muscular dystrophy and other neuromuscular disorders, peripheral nervous system disorders, 
dermatomyositis and polymyositis, inherited, metabolic, endocrine, and toxic myopathies, myasthenia 
gravis, periodic paralysis, mental disorders including mood, anxiety, and schizophrenic disorders, 

15 seasonal affective disorder (SAD), akathesia, amnesia, catatonia, diabetic neuropathy, tardive 
dyskinesia, dystonias, paranoid psychoses, postherpetic neuralgia, and Tourette's disorder; an 
autoimmune/inflammatory disorder, such as acquired immunodeficiency syndrome (AIDS), 
Addison's disease, adult respiratory distress syndrome, allergies, ankylosing spondylitis, amyloidosis, 
anemia, asthma, atherosclerosis, autoimmune hemolytic anemia, autoimmune thyroiditis, 

20 autoimmune polyendocrinopathy-candidiasis-ectodermal dystrophy (APECED), bronchitis, 
cholecystitis, contact dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, diabetes 
mellitus, emphysema, episodic lymphopenia with lymphocytotoxins, erythroblastosis fetalis, 
erythema nodosum, atrophic gastritis, glomerulonephritis, Goodpasture's syndrome, gout, Graves' 
disease, Hashimoto's thyroiditis, hypereosinophilia, irritable bowel syndrome, multiple sclerosis, 

25 myasthenia gravis, myocardial or pericardial inflammation, osteoarthritis, osteoporosis, pancreatitis, 
polymyositis, psoriasis, Reitcr's syndrome, rheumatoid arthritis, scleroderma, Sjogren's syndrome, 
systemic anaphylaxis, systemic lupus erythematosus, systemic sclerosis, thrombocytopenic purpura, 
ulcerative colitis, uveitis, Werner syndrome, complications of cancer, hemodialysis, and 
extracorporeal circulation, viral, bacterial, fungal, parasitic, protozoal, and helminthic infections, and 

30 trauma; and a viral disorder, such as viral infections, e.g., those caused by adenoviruses (acute 
respiratory disease, pneumonia), arenaviruses (lymphocytic choriomeningitis), bunyaviruses 
(Hantavirus), coronaviruscs (pneumonia, chronic bronchitis), hepadnaviruses (hepatitis), 
herpesviruses (herpes simplex virus, varicella-zoster virus, Epstein-Barr vims, cytomegalovirus), 
flaviviruses (yellow fever), orthomyxoviruses (influenza), papillomaviruses (cancer), 

35 paramyxoviruses (measles, mumps), picornoviruses (rhinovirus, poliovirus, coxsackie-virus), 
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polyomavinises (BK virus, JC virus), poxviruses (smallpox), reovirus (Colorado tick fever), 
retroviruses (human immunodeficiency vims, human T lymphotropic virus), rhabdoviruses (rabies), 
rotaviruses (gastroenteritis), and togaviruses (encephalitis, rubella). 

In another embodiment, a vector capable of expressing ORP or a fragment or derivative thereof 
5 may be administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of ORP including, but not limited to, those described above. 

In a further embodiment, a composition comprising a substantially purified ORP in conjunction 
with a suitable pharmaceutical carrier may be administered to a subject to treat or prevent a disorder 
associated with decreased expression or activity of ORP including, but not limited to, those provided 
10 above. 

In still another embodiment, an agonist which modulates the activity of ORP may he 
administered to a subject to treat or prevent a disorder associated with decreased expression or activity 
of ORP including, but not limited to, those listed above. 

In a further embodiment, an antagonist of ORP may be administered to a subject to treat or 

15 prevent a disorder associated with increased expression or activity of ORP. Examples of such 

disorders include, but are not limited to, those cell proliferative disorders including cancer; endocrine, 
metabolic, reproductive, neurological, viral, and autoimmune/inflammatory disorders described above. 
In one aspect, an antibody which specifically binds ORP may be used directly as an antagonist or 
indirectly as a targeting or delivery mechanism for bringing a pharmaceutical agent to cells or tissues 

20 which express ORP. 

In an additional embodiment, a vector expressing the complement of the polynucleotide 
encoding ORP may be administered to a subject to treat or prevent a disorder associated with increased 
expression or activity of ORP including, but not limited to, those described above. 

In other embodiments, any of the proteias, antagonists, antibodies, agonists, complementary 

25 sequences, or vectors of the invention may be administered in combination with other appropriate 

therapeutic agents. Selection of the appropriate agents for use in combination therapy may be made by 
one of ordinary skill in the art, according to conventional pharmaceutical principles. The combination 
of therapeutic agents may act synergistically to effect the treatment or prevention of the various 
disorders described above. Using this approach, one may be able to achieve therapeutic efficacy with 

30 lower dosages of each agent, thus reducing the potential for adverse side effects. 

An antagonist of ORP may be produced using methods which are generally known in the art. 
In particular, purified ORP may be used to produce antibodies or to screen libraries of pharmaceutical 
agents to identify those which specifically bind ORP. Antibodies to ORP may also be generated using 
methods that are well known in the art. Such antibodies may include, but are not limited to, polyclonal, 

35 monoclonal, chimeric, and single chain antibodies, Fab fragments, and fragments produced by a Fab 
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expression library. Neutralizing antibodies (i.e., those which inhibit dimer formation) are generally 
preferred for therapeutic use. 

For the production of antibodies, various hosts including goats, rabbits, rats, mice, humans, 
and others may be immunized by injection with ORP or with any fragment or oligopeptide thereof 

5 which has immunogenic properties. Depending on the host species, various adjuvants may be used to 
increase immunological response. Such adjuvants include, but are not limited to, Freund's, mineral gels 
such as aluminum hydroxide, and surface active substances such as lysolecithin, pluronic polyols, 
polyanions, peptides, oil emulsions, KLH, and dinitrophenol. Among adjuvants used in humans, BCG 
(bacilli C almette-Guerin) and Corvnebacterium parvum are especially preferable. 

10 It is preferred that the oligopeptides, peptides, or fragments used to induce antibodies to ORP 

have an amino acid sequence consisting of at least about 5 amino acids, and generally will consist of at 
least about 10 amino acids. It is also preferable that these oligopeptides, peptides, or fragments are 
identical to a portion of the amino acid sequence of the natural protein. Short stretches of ORP amino 
acids may be fused with those of another protein, such as KLH, and antibodies to the chimeric molecule 

15 may be produced. 

Monoclonal antibodies to ORP may be prepared using any technique which provides for the 
production of antibody molecules by continuous cell lines in culture. These include, but are not limited 
to, the hybridoma technique, the human B-cen hybridoma technique, and the EBV-hybridoma 
technique. (See, e.g., KoMer, G. et al. (1975) Nature 256:495-497; Kozbor, D. et al. (1985) J. 

20 Immunol. Methods 81 :31-42; Cote, R.J. et al. (1983) Proc. Natl. Acad. Sci. USA 80:2026-2030; and 
Cole, S.P. et al. (1984) Mol. Cell Biol. 62:109-120.) 

In addition, techniques developed for the production of "chimeric antibodies," such as the 
splicing of mouse antibody genes to human antibody genes to obtain a molecule with appropriate 
antigen specificity and biological activity, can be used. (See, e.g., Morrison, S.L. ct al. (1984) Proc. 

25 Natl. Acad. Sci. USA 81 :685 1-6855; Neuberger, M.S. et al. (1984) Nature 31 2:604-608; and Takeda, 
S. et al. (1985) Nature 314:452-454.) Alternatively, techniques described for the production of single 
chain antibodies may be adapted, using methods known in the art, to produce ORP-specific single chain 
antibodies. Antibodies with related specificity, but of distinct idiotypic composition, may be generated 
by chain shuffling from random combinatorial immunoglobulin libraries. (See, e.g., Burton, D.R. 

30 (1991) Proc. Natl. Acad Sci. USA 88:10134-10137.) 

Antibodies may also be produced by inducing in vivo production in the lymphocyte population 
or by screening immunoglobulin libraries or panels of highly specific binding reagents as disclosed in 
the literature. (See, e.g., Orlandi, R. et al. (1989) Proc. Natl. Acad. Sci. USA 86:3833-3837; Winter, 
G. et al. (1991) Nature 349:293-299.) 

35 Antibody fragments which contain specific binding sites for ORP may also be generated For 
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example, such fragments include, but are not limited to, F(at0 2 fragments produced by pepsin digestion 
of the antibody molecule and Fab fragments generated by reducing the disulfide bridges of the F(ab")2 
fragments. Alternativdy, Fab expression libraries may be constructed to allow rapid and easy 
identification of monoclonal Fab fragments with the desired specificity, (See, e.g., Huse, W.D. et al. 

5 (1989) Science 246:1275-1281.) 

Various immunoassays may be used for screening to identify antibodies having the desired 
specificity. Numerous protocols for competitive binding or immunoradiometric assays using either 
polyclonal or monoclonal antibodies with established specificities are well known in the art. Such 
immunoassays typically involve the measurement of complex formation between ORP and its specific 

10 antibody. A two-site, monoclonal-based immunoassay utilizing monoclonal antibodies reactive to two 
non-interfering ORP epitopes is generally used, but a competitive binding assay may also be employed 
(Pound, supra) . 

Various methods such as Scatchard analysis in conjunction with radioimmunoassay techniques 
may be used to assess the affinity of antibodies for ORP. Affinity is expressed as an association 

15 constant, K., which is defined as the molar concentration of ORP-antibody complex divided by the 

molar concentrations of free antigen and free antibody under equilibrium conditions. The' determined 
for a preparation of polyclonal antibodies, which are heterogeneous in their affinities for multiple ORP 
epitopes, represents the average affinity, or avidity, of the antibodies for ORP. The determined for a 
- preparation of monoclonal antibodies, which are monospecific for a particular ORP epitope, represents 

20 a true measure of affinity. High-affinity antibody preparations with ranging from about 10 9 to 10 12 
L/mole are preferred for use in immunoassays in which the ORP-antibody complex must withstand 
rigorous manipulations. Low-affinity antibody preparations with K, ranging from about 10 6 to 10 7 
L/mole are preferred for use in immunopurification and similar procedures which ultimately require 
dissociation of ORP, preferably in active form, from the antibody (Catty, D. (1988) Antibodies, 

25 Volume I: A Practical Approach . IRL Press, Washington DC; Liddell, J.E. and A. Cryer (1991) A 
Practical Guide to Monoclonal Antibodies. John Wiley & Sons, New York NY). 

The titer and avidity of polyclonal antibody preparations may be turther evaluated to determine 
the quality and suitability of such preparations for certain downstream applications. For example, a 
polyclonal antibody preparation containing at least 1-2 mg specific antibody/ml, preferably 5-10 mg 

30 specific antibody/ml, is generally employed in procedures requiring precipitation of ORP-antibody 
complexes. Procedures for evaluating antibody specificity, titer, and avidity, and guidelines for 
antibody quality and usage in various applications, are generally available. (See, e.g., Catty, supra , and 
Coliganetal.. supra .) 

In another embodiment of the invention, the polynucleotides encoding ORP, or any fragment or 
35 complement thereof, may be used for therapeutic purposes. In one aspect, modifications of gene 
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expression can be achieved by designing complementary sequences or antisense molecules (DN A, RNA, 
PNA, or modified oligonucleotides) to the coding or regulatory regions of the gene encoding ORP. 
Such technology is well known in the art, and antisense oligonucleotides or largo- fragments can be 
designed from various locations along the coding or control regions of sequences encoding ORP. (See, 
5 e.g., Agrawal, S., ed. (1 996) Antisense Therapeutics. Humana Press Inc., Totawa N J.) 

In therapeutic use, any gene delivery system suitable for introduction of the antisense 
sequences into appropriate target cells can be used. Antisense sequences can be delivered 
intracellular^ in the form of an expression plasmid which, upon transcription, produces a sequence 
complementary to at least a portion of the cellular sequence encoding the target protein. (See, e.g., 

10 Slater, J.E. et al. (1998) J. Allergy Clin. Immunol. 102(3):469-475; and Scanlon, K.J. et al. (1995) 
9( 1 3) : 1 288-1 296.) Antisense sequences can also be introduced intracellular^ through the use of viral 
vectors, such as retrovirus and adeno- associated virus vectors. (See, e.g., Miller, A.D. (1990) Blood 
76:271; Ausubel, supra : Uckert, W. and W. Walther (1994) Pharmacol. Ther. 63(3):323-347.) Other 
gene delivery mechanisms include liposome-derived systems, artificial viral envelopes, and other 

15 systems known in the art. (See, e.g., Rossi, JJ. (1995) Br. Med. Bull. 51(l):217-225; Boado, R.J. et 
al. (1998) J. Pharm. Sci. 87(1 1):1308-1315; and Morris, M.C. et al. (1997) Nucleic Acids Res. 
25(14);2730-2736.) 

In another embodiment of the invention, polynucleotides encoding ORP may be used for 
somatic or germline gene therapy. Gene therapy may be performed to (i) correct a genetic deficiency 

20 (e.g., in the cases of severe combined immunodeficiency (SCID)-Xl disease characterized by X-linked 
inheritance (Cavazzana-Calvo, M. et al. (2000) Science 288:669-672), severe combined 
immunodeficiency syndrome associated with an inherited adenosine deaminase (ADA) deficiency 
(Blaese, R.M. et al. (1995) Science 270:475-480; Bordignon, C. et al. (1995) Science 270:470-475), 
cystic fibrosis (Zabner, J. et al. (1993) Cell 75:207-216; Crystal, R.G. et al. (1995) Hum. Gene 

25 Therapy 6:643-666; Crystal, R.G. et al. (1995) Hum. Gene Therapy 6:667-703), thalassamias, familial 
hypercholesterolemia, and hemophilia resulting from Factor VIII or Factor IX deficiencies (Crystal, 
R.G. (1995) Science 270:404-410; Verma, I.M. and N. Somia (1997) Nature 389:239-242)), (ii) 
express a conditionally lethal gene product (e.g., in the case of cancers which result from unregulated 
cell proliferation), or (iii) express a protein which affords protection against intracellular parasites (e.g., 

30 against human retroviruses, such as human immunodeficiency virus (HIV) (Baltimore, D. (1988) 
Nature 335:395-396; Poeschla, E. et al. (1996) Proc. Natl. Acad Sci. USA. 93:1 1395-1 1399), 
hepatitis B or C virus (HBV, HCV); fungal parasites, such as Candida albicans and Paracoccidioides 
brasiliensis ; and protozoan parasites such as Plasmodium falciparum and Trypanosoma cruz.O . In the 
case where a genetic deficiency in ORP expression or regulation causes disease, the expression of ORP 

35 from an appropriate population of transduced cells may alleviate the clinical manifestations caused by 
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the genetic deficiency. 

In a further embodiment of the invention, diseases or disorders caused by deficiencies in ORP 
are treated by constructing mammalian expression vectors encoding ORP and introducing these vectors 
by mechanical means into ORP-deficient cells. Mechanical transfer technologies for use with cells in 
5 vivo or ex vitro include (i) direct DNA microinjection into individual cells, (ii) ballistic gold particle 
delivery, (iii) liposome-mediated transfection, (iv) receptor-mediated gene transfer, and (v) the use of 
DNA transposons (Morgan, R.A. and W.F. Anderson (1993) Annu. Rev. Biochem. 62:191-217; Ivies, 
Z. (1997) Cell 91:501-510; Boulay, J-L. and H. R6cipon (1998) Curr. Opia Biotechnol. 9:445-450). 

Expression vectors that may be effective for the expression of ORP include, but are not limited 

10 to, the PCDNA 3.1, EPITAG, PRCCMV2, PREP, PVAX vectors (Invitrogen, Carlsbad CA), 
PCMV-SCR1PT, PCMV-TAG, PEGSH/PERV (Stratagene, La Jolla CA), and PTET-OFF, 
PTET-ON, PTRE2, PTRE2-LUC, PTK-HYG (Clontech, Palo Alto CA). ORP may be expressed 
using (i) a constitutivdy active promoter, (e.g„ from cytomegalovirus (CMV), Rous sarcoma virus 
(RSV), SV40 virus, thymidine kinase (TK), or p-actin genes), (ii) an inducible promoter (e.g., the 

15 tetracycline-regulated promoter (Gossen, M. and H. Bujard (1992) Proc. Natl. Acad. Sci. USA 

89:5547-5551; Gossen, M. et al. (1995) Science 268:1766-1769; Rossi, F.M.V. and H.M. Blau (1998) 
Curr. Opin. Biotechnol. 9:45 1 -456), commercially available in the T-REX plasmid (Invitrogen)); the 
ecdysonc-inducible promoter (available in the plasmids PVGRXR and PIND; Invitrogen); the 
FK506/rapamycin inducible promoter; or the RU486/mifepristone inducible promoter (Rossi, F.M. V. 

20 and H.M. Blau, supra)), or (iii) a tissue-specific promoter or the native promoter of the endogenous 
gene encoding ORP from a normal individual. 

Commercially available liposome transformation kits (e.g., the PERFECT LIPID 
TRANSFECTION KIT, available from Invitrogen) allow one with ordinary skill in the art to deliver 
polynucleotides to target cells in culture and require minimal effort to optimize experimental 

25 parameters. In the alternative, transformation is performed using the calcium phosphate method 
(Graham, F.L. and A.J. Eb (1973) Virology 52:456-467), or by electroporation (Neumann, E. et al. 
(1982) EMBO J. 1 :841-845). The introduction of DNA to primary cells requires modification of these 
standardized mammalian transfection protocols. 

In another embodiment of the invention, diseases or disorders caused by genetic defects with 

30 respect to ORP expression arc treated by constructing a retrovirus vector consisting of (i) the 
polynucleotide encoding ORP under the control of an independent promoter or the retrovirus long 
terminal repeat (LTR) promoter, (ii) appropriate RNA packaging signals, and (iii) a Rev-responsive 
dement (RRE) along with additional retrovirus as-acting RNA sequences and coding sequences 
required for efficient vector propagatioa Retrovirus vectors (e.g., PFB and PFBNEO) are 

35 commercially available (Stratagene) and are based on published data (Riviere, I. et al. (1995) Proc. 
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Natl. Acad Sci. USA 92:6733-6737), incorporated by reference herein. The vector is propagated in an 
appropriate vector producing cell line (VPCL) that expresses an envelope gene with a tropism for 
receptors on the target cells or a promiscuous envelope protein such as VSVg (Armentano, D. et al. 
(1987) J. Virol. 61:1647-1650; Bender, M.A. et al. (1987) J. Virol. 61:1639-1646; Adam, M.A. and 
5 A.D. Miller (1988) J. Virol. 62:3802-3806; Dull, T. et al. (1998) J. Virol. 72:8463-8471; Zufferey, R. 
et al. (1998) J. Virol. 72:9873-9880). U.S. Patent Number 5,910,434 to Rigg ("Method for obtaining 
retrovirus packaging cell lines producing high transducing efficiency retroviral supernatant") discloses a 
method for obtaining retrovirus packaging cell lines and is hereby incorporated by reference. 
Propagation of retrovirus vectors, transduction of a population of cells (e.g., CD4* T-cells), and the 

10 return of transduced cells to a patient are procedures well known to parsons skilled in the art of gene 
therapy and have been well documented (Ranga, U. et al. (1997) J. Virol 71:7020-7029; Bauer, G. et 
al. (1997) Blood 89:2259-2267; Bonyhadi, M.L. (1997) J. Virol. 71:4707-4716; Ranga, U. et al. 
(1998) Proc. Natl. Acad. Sci. USA 95:1201-1206; Su, L. (1997) Blood 89:2283-2290). 

In the alternative, an adenovirus-based gene therapy delivery system is used to deliver 

15 polynucleotides encoding ORP to cells which have one or more genetic abnormalities with respect to the 
expression of ORP. The construction and packaging of adenovirus-based vectors are well known to 
those with ordinary skill in the art Replication defective adenovirus vectors have proven to be versatile 
for importing genes encoding immunoregulatory proteins into intact islets in the pancreas (Csete, M.E. 
et al. (1995) Transplantation 27:263-268). Potentially useful adenoviral vectors are described in U.S. 

20 Patent Number 5,707,618 to Armentano ("Adenovirus vectors for gene therapy"), hereby incorporated 
by reference. For adenoviral vectors, see also Antinozzi, P.A. et al. (1999) Annu. Rev. Nutr. 
19:51 1-544; and Verma, I.M. and N. Somia (1997) Nature 18:389:239-242, both incorporated by 
reference herein. 

In another alternative, a herpes-based, gene therapy delivery system is used to deliver 
25 polynucleotides encoding ORP to target cells which have one or more genetic abnormalities with respect 
to the expression of ORP. The use of hopes simplex virus (HS V)-based vectors may be especially 
valuable for introducing ORP to cells of the central nervous system, for which HS V has a tropism. The 
construction and packaging of herpes-based vectors are well known to those with ordinary skill in the 
art. A replication-competent herpes simplex virus (HSV) type 1 -based vector has been used to deliver a 
30 reporter gene to the eyes of primates (Liu, X. et al. (1999) Exp. Eye Res. 169:385-395). The 

construction of a HSV-1 virus vector has also been disclosed in detail in U.S. Patent Number 5,804,413 
to DeLuca ("Herpes simplex virus strains for gene transfer"), which is hereby incorporated by 
reference. U.S. Patent Number 5,804,41 3 teaches the use of recombinant HSV d92 which consists of a 
genome containing at least one exogenous gene to be transferred to a cell under the control of the 
35 appropriate promoter for purposes including human gene therapy. Also taught by this patent are the 
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construction and use of recombinant HS V strains deleted for ICP4, ICP27 and ICP22. For HS V 
vectors, see also Coins, W.R et aL (1999) J. Virol. 73:519-532 andXu, H. et al. (1994) Dev. Biol. 
163:152-161, hereby incorporated by reference. The manipulation of cloned herpesvirus sequences, the 
generation of recombinant virus following the transfection of multiple plasmids containing different 
5 segments of the large herpesvirus genomes, the growth and propagation of herpesvirus, and the 
infection of cells with herpesvirus are techniques well known to those of ordinary skill in the art 

In another alternative, an alphavirus (positive, single-stranded RNA virus) vector is used to 
deliver polynucleotides encoding ORP to target cells. The biology of the prototypic alphavirus, Semliki 
Forest Virus (SFV), has been studied extensively and gene transfer vectors have been based on the SFV 

10 genome (Garoff, H. and K.-J. Li (1998) Curr. Opin. Biotechnol. 9:464-469). During alphavirus RNA 
replication, a subgenomic RNA is generated that normally encodes the viral capsid proteins. This 
subgenomic RNA replicates to higher levels than the full-length genomic RNA, resulting in the 
overproduction of capsid proteins relative to the viral proteins with enzymatic activity (e.g., protease 
and polymerase). Similarly, inserting the coding sequence for ORP into the alphavirus genome in place 

15 of the capsid-coding region results in the production of a large number of ORP-coding RN As and the 
synthesis of high levels of ORP in vector transduced cells. While alphavirus infection is typically 
associated with cell lysis within a few days, the ability to establish a persistent infection in hamster 
normal kidney cells (BHK-21) with a variant of Sindbis virus (SIN) indicates that the lytic replication 
of alphaviruscs can be altered to suit the needs of the gene therapy application (Dryga, S. A. et al. 

20 (1997) Virology 228:74-83). The wide host range of alphaviruses will allow the introduction of ORP 
into a variety of cell types. The specific transduction of a subset of cells in a population may require 
the sorting of cells prior to transduction. The methods of manipulating infectious cDNA clones of 
alphaviruses, performing alphavirus cDNA and RNA transfections, and performing alphavirus 
infections, are well known to those with ordinary skill in the art. 

25 Oligonucleotides derived from the transcription initiation site, e.g., between about positions -10 

and +10 from the start site, may also be employed to inhibit gene expressioa Similarly, inhibition can 
be achieved using triple helix base-pairing methodology. Triple helix pairing is useful because it causes 
inhibition of the ability of the double helix to open sufficiently for the binding of polymerases, 
transcription factors, or regulatory molecules. Recent therapeutic advances using triplex DNA have 

30 been described in the literature. (See, e.g., Gee, J.E. et al. (1994) in Huber, B.E. and B.I. Carr, 
Molecular and Immunologic Approaches , Futura Publishing, Mt. KiscoNY, pp. 163-177.) A 
complementary sequence or anti sense molecule may also be designed to block translation of mRNA by 
preventing the transcript from binding to ribosomes. 

Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific cleavage of 

35 RNA The mechanism of ribozyme action involves sequence-specific hybridization of the ribozyme 
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molecule to complementary target RNA, followed by endonucleolytic cleavage. For example, 
engineered hammerhead motif ribozyme molecules may specifically and efficiently catalyze 
endonucleolytic cleavage of sequences encoding ORP. 

Specific ribozyme cleavage sites within any potential RNA target are initially identified by 

5 scanning the target molecule for ribozyme cleavage sites, including the following sequences: GUA, 
GUU, and GUC. Once identified, short RNA sequences of between 15 and 20 ribonucleotides, 
corresponding to the region of the target gene containing the cleavage site, may be evaluated for 
secondary structural features which may render the oligonucleotide inoperable. The suitability of 
candidate targets may also be evaluated by testing accessibility to hybridization with complementary 

10 oligonucleotides using ribonuclease protection assays. 

Complementary ribonucleic acid molecules and ribozymes of the invention may be prepared by 
any method known in the art for the synthesis of nucleic acid molecules. These include techniques for 
chemically synthesizing oligonucleotides such as solid phase phosphoramidite chemical synthesis. 
Alternatively, RNA molecules may be generated by in vitro and in vivo transcription of DNA sequences 

15 encoding ORP. Such DNA sequences may be incorporated into a wide variety of vectors with suitable 
RNA polymerase promoters such as T7 or SP6. Alternatively, these cDNA constructs that synthesize 
complementary RNA constitutively or inducibly, can be introduced into cell lines, cells, or tissues. 

RNA molecules may be modified to increase intracellular stability and half-life. Possible 
modifications include, but are not limited to, the addition of flanking sequences at the 5* and/or 3' ends 

20 of the molecule, or the use of phosphorothioate or T Omethyl rather than phosphodiesterase linkages 
within the backbone of the molecule. This concept is inherent in the production of PNAs and can be 
extended in all of these molecules by the inclusion of nontraditional bases such as inosine, queosine, and 
wybutosine, as well as acetyl-, methyl-, thio-, and similarly modified forms of adenine, cytidine, 
guanine, thymine, and uridine which are not as easily recognized by endogenous endonucleases. 

25 An additional embodiment of the invention encompasses a method for screening for a 

compound which is effective in altering expression of a polynucleotide encoding ORP. Compounds 
which may be effective in altering expression of a specific polynucleotide may include, but are not 
limited to, oligonucleotides, antisense oligonucleotides, triple helix-forming oligonucleotides, 
transcription factors and other polypeptide transcriptional regulators, and non-macromolccular 

30 chemical entities which are capable of interacting with specific polynucleotide sequences. Effective 
compounds may alter polynucleotide expression by acting as either inhibitors or promoters of 
polynucleotide expression. Thus, in the treatment of disorders associated with increased ORP 
expression or activity, a compound which specifically inhibits expression of the polynucleotide 
encoding ORP may be therapeutically useful, and in the treament of disorders associated with 

35 decreased ORP expression or activity, a compound which specifically promotes expression of the 
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polynucleotide encoding ORP may be therapeutically useful. 

At least one, and up to a plurality, of test compounds may be screened for effectiveness in 
altering expression of a specific polynucleotide. A test compound may be obtained by any method 
commonly known in the art, including chemical modification of a compound known to be effective in 
5 altering polynucleotide expression; selection from an existing, commercially-available or proprietary 
library of naturally-occurring or non-natural chemical compounds; rational design of a compound 
based on chemical and/or structural properties of the target polynucleotide; and selection from a 
library of chemical compounds created combinatorially or randomly. A sample comprising a 
polynucleotide encoding ORP is exposed to at least one test compound thus obtained. The sample 

10 may comprise, for example, an intact or permeabilized cell, or an in vitro cell-free or reconstituted 
biochemical system. Alterations in the expression of a polynucleotide encoding ORP are assayed by 
any method commonly known in the art Typically, the expression of a specific nucleotide is detected 
by hybridization with a probe having a nucleotide sequence complementary to the sequence of the 
polynucleotide encoding ORP. The amount of hybridization may be quantified, thus forming the 

15 basis for a comparison of (he expression of the polynucleotide both with and without exposure to one 
or more test compounds. Detection of a change in the expression of a polynucleotide exposed to a 
test compound indicates that the test compound is effective in altering the expression of the 
polynucleotide. A screen for a compound effective in altering expression of a specific polynucleotide 
can be carried out, for example, using a Schizosaccharomvces pombe gene expression system (Atkins, 

20 D. et al. (1999) U.S. Patent No. 5,932,435; Arndt, CM. et al. (2000) Nucleic Acids Res. 28:E15) or a 
human cell line such as HeLa cell (Clarke, M.L. et al. (2000) Biochem. Biophys. Res. Commun. 
268:8-13). A particular embodiment of the present invention involves screening a combinatorial 
library of oligonucleotides (such as deoxyribonucleotides, ribonucleotides, peptide nucleic acids, and 
modified oligonucleotides) for antisensc activity against a specific polynucleotide sequence (Bruice, 

25 T.W. et al. (1997) U.S. Patent No. 5,686,242; Bruice, T.W. et al. (2000) U.S. Patent No. 6,022,691). 

Many methods for introducing vectors into cells or tissues are available and equally suitable for 
use in vivo, in vitro , and ex vivo . For ex vivo therapy, vectors may be introduced into stem cells taken 
from the patient and clonally propagated for autologous transplant back into that same patient 
Delivery by transfection, by liposome injections, or by polycationic amino polymers may be achieved 

30 using methods which are well known in the art. (See, e.g., Goldman, C.K. et al. (1997) Nat. 
Biotechnol. 15:462-466.) 

Any of the therapeutic methods described above may be applied to any subject in need of such 
therapy, including, for example, mammals such as humans, dogs, cats, cows, horses, rabbits, and 
monkeys. 

35 An additional embodiment of the invention relates to the administration of a composition which 
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generaUy comprises an active ingredient formulated with a pharmaceutical^ acceptable excipient. 
Excipients may include, for example, sugars, starches, celluloses, gums, and proteins. Various 
formulations are commonly known and are thoroughly discussed in the latest edition of Remington's 
Pharmaceutical Sciences (Maack Publishing, Easton PA). Such compositions may consist of ORP, 
5 antibodies to ORP, and mimetics, agonists, antagonists, or inhibitors of ORP. 

The compositions utilized in this invention may be administered by any number of routes 
including, but not limited to, oral, intravenous, intramuscular, intra-arterial, intramedullary, intrathecal, 
intraventricular, pulmonary, transdermal, subcutaneous, intraperitoneal, intranasal, enteral, topical, 
sublingual, or rectal means. 
10 Compositions for pulmonary administration may be prepared in liquid or dry powder form. 

These compositions are generally aerosolized immediately prior to inhalation by the patient. In the case 
of small molecules (e.g. traditional low molecular weight organic drugs), aerosol delivery of fast-acting 

formulations is well-known in the art In the case of macromolecules (e.g. largo* peptides and proteins), 

< 

recent developments in the field of pulmonary delivery via the alveolar region of the lung have enabled 
15 the practical delivery of drugs such as insulin to blood circulation (see, e.g., Patton, J.S. et al., U.S. 

Patent No. 5,997,848). Pulmonary delivery has the advantage of administration without needle 

injection, and obviates the need for potentially toxic penetration enhancers. 

Compositions suitable for use in the invention include compositions wherein the active 

ingredients are contained in an effective amount to achieve the intended purpose. The determination of 
20 an effective dose is well within the capability of those skilled in the art 

Specialized forms of compositions may be prepared for direct intracellular delivery of 

macromolecules comprising ORP or fragments thereof. For example, liposome preparations containing 

a cell-impermeable macromolecule may promote cell fusion and intracellular delivery of the 

macromolecule. Alternatively, ORP or a fragment thereof may be joined to a short cationic N-terminal 
25 portion from the HIV Tat-1 protein. Fusion proteins thus generated have been found to transduce into 

the cells of all tissues, including the brain, in a mouse model system (Schwarze, S.R. et al. (1999) 

Science 285:1569-1572). 

For any compound, the therapeutically effective dose can be estimated initially either in cell 

culture assays, e.g., of neoplastic cells, or in animal models such as mice, rats, rabbits, dogs, monkeys, 
30 or pigs. An animal model may also be used to determine Uie appropriate concentration range and route 

of administration. Such information can then be used to determine useful doses and routes for 

administration in humans. 

A therapeutically effective dose refers to that amount of active ingredient, for example ORP or 

fragments thereof, antibodies of ORP, and agonists, antagonists or inhibitors of ORP, which 
35 ameliorates the symptoms or condition. Therapeutic efficacy and toxicity may be determined by 
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standard pharmaceutical procedures in cell cultures or with experimental animals, such as by 
calculating the ED^ (the dose therapeutically effective in 50% of the population) or LD 50 (the dose 
lethal to 50% of the population) statistics. The dose ratio of toxic to therapeutic effects is the 
therapeutic index, which can be expressed as the LDst/ED^ ratio. Compositions which exhibit large 
5 therapeutic indices are preferred. The data obtained from cell culture assays and animal studies are 
used to formulate a range of dosage for human use. The dosage contained in such compositions is 
preferably within a range of circulating concentrations that includes the ED 50 with little or no toxicity. 
The dosage varies within this range depending upon the dosage form employed, the sensitivity of the 
patient, and the route of administration. 

10 The exact dosage will be determined by the practitioner, in light of factors related to the subject 

requiring treatment. Dosage and administration are adjusted to provide sufficient levds of the active 
moiety or to maintain the desired effect. Factors which may be taken into account include the severity 
of the disease state, the general health of the subject, the age, weight, and gender of the subject, time 
and frequency of administration, drug combination^), reaction sensitivities, and response to therapy. 

15 Long-acting compositions may be administered every 3 to 4 days, every week, or biweekly depending 
on the half-life and clearance rate of the particular formulation. 

Normal dosage amounts may vary from about 0.1 g to 1 00,000 g, up to a total dose of 
about 1 gram, depending upon the route of administratioa Guidance as to particular dosages and 
methods of delivery is provided in the literature and generally available to practitioners in the art. 

20 Those skilled in the art will employ different formulations for nucleotides than for proteins or their 
inhibitors. Similarly, delivery of polynucleotides or polypeptides will be specific to particular cells, 
conditions, locations, etc. 
DIAGNOSTICS 

In another embodiment, antibodies which specifically bind ORP may be used for the diagnosis 
25 of disorders characterized by expression of ORP, or in assays to monitor patients being treated with 
ORP or agonists, antagonists, or inhibitors of ORP. Antibodies useful for diagnostic purposes may be 
prepared in the same manna- as described above for therapeutics. Diagnostic assays for ORP include 
methods which utilize the antibody and a label to detect ORP in human body fluids or in extracts of 
cells or tissues. The antibodies may be used with or without modification, and may be labeled by 
30 covalent or non-covalent attachment of a reporter molecule. A wide variety of reporter molecules, 
several of which are described above, are known in the art and may be used. 

A variety of protocols for measuring ORP, including ELIS As, RIAs, and FACS, are known in 
the art and provide a basis for diagnosing altered or abnormal levels of ORP expression. Normal or 
standard values for ORP expression are established by combining body fluids or cell extracts taken 
35 from normal mammalian subjects, for example, human subjects, with antibody to ORP under conditions 

48 



WO 01/44448 



PCTYUS00/33158 



suitable for complex formatioa The amount of standard complex formation may be quantitated by 
various methods, such as photometric means. Quantities of ORP expressed in subject, control, and 
disease samples from biopsied tissues are compared with the standard values. Deviation between 
standard and subject values establishes the parameters for diagnosing disease. 

5 In another embodiment of the invention, the polynucleotides encoding ORP may be used for 

diagnostic purposes. The polynucleotides which may be used include oligonucleotide sequences, 
complementary RNA and DNA molecules, and PNAs. The polynucleotides may be used to detect and 
quantify gene expression in biopsied tissues in which expression of ORP may be correlated with 
disease. The diagnostic assay may be used to determine absence, presence, and excess expression of 

10 ORP, and to monitor regulation of ORP levels during therapeutic intervention. 

In one aspect, hybridization with PCR probes which are capable of detecting polynucleotide 
sequences, including genomic sequences, encoding ORP or closely related molecules may be used to 
identify nucleic acid sequences which encode ORP. The specificity of the probe, whether it is made 
from a highly specific region, e.g., the 5' regulatory region, or from a less specific region, e.g„ a 

15 conserved motif, and the stringency of the hybridization or amplification will determine whether the 
probe identifies only naturally occurring sequences encoding ORP, allelic variants, or related sequences. 

Probes may also be used for the detection of related sequences, and may have at least 50% 
sequence identity to any of the ORP encoding sequences. The hybridization probes of the subject 
invention may be DNA or RNA and may be derived from the sequence of SEQ ID NO:28-54 or from 

20 genomic sequences including promoters, enhancers, and introns of the ORP gene. 

Means for producing specific hybridization probes for DNAs encoding ORP include the cloning 
of polynucleotide sequences encoding ORP or ORP derivatives into vectors for the production of 
mRNA probes. Such vectors are known in the art, are commercially available, and may be used to 
synthesize RNA probes in vitro by means of the addition of the appropriate RNA polymerases and the 

25 appropriate labeled nucleotides. Hybridization probes may be labeled by a variety of reporter groups, 
for example, by radionuclides such as 32 P or 35 S, or by enzymatic labels, such as alkaline phosphatase 
coupled to the probe via avidin/biotin coupling systems, and the like. 

Polynucleotide sequences encoding ORP may be used for the diagnosis of disorders associated 
with expression of ORP. Examples of such disorders include, but are not limited to, a cell proliferative 

30 disorder, such as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, mixed 
connective tissue disease, myelofibrosis, paroxysmal nocturnal hemoglobinuria, polycythemia vera, 
psoriasis, primary thrombocythemi a; a cancer, such as adenocarcinoma, leukemia, lymphoma, 
melanoma, myeloma, sarcoma, teratocarcinoma, and, in particular, cancers of the adrenal gland, 
bladder, bone, bone marrow, brain, breast, cervix, gall bladder, ganglia, gastrointestinal tract, heart, 

35 kidney, liver, lung, muscle, ovary, pancreas, parathyroid, penis, prostate, salivary glands, skin, spleen, 
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testis, thymus, thyroid, and uterus; an endocrine disorder, such as disorders of the hypothalamus and 
pituitary resulting from lesions such as primary brain tumors, adenomas, infarction associated with 
pregnancy, hypophysectomy, aneurysms, vascular malformations, thrombosis, infections, 
immunological disorders, and complications due to head trauma, disorders associated with 
5 hypopituitarism including hypogonadism, Sheehan syndrome, diabetes insipidus, Kallman's disease, 
Hand-Schuller-Christian disease, Letterer-Siwe disease, sarcoidosis, empty sella syndrome, and 
dwarfism, disorders associated with hypopituitarism including acromegaly, giantism, and syndrome 
of inappropriate antidiuretic hormone (ADH) secretion (SIADH) often caused by benign adenoma, 
disorders associated with hypothyroidism including goiter, myxedema, acute thyroiditis associated 

10 with bacterial infection, subacute thyroiditis associated with viral infection, autoimmune thyroiditis 
(Hashimoto's disease), and cretinism, disorders associated with hyperthyroidism including 
thyrotoxicosis and its various forms, Grave's disease, pretibial myxedema, toxic multinodular goiter, 
thyroid carcinoma, and Plummets disease, disorders associated with hyperparathyroidism including 
Conn disease (chronic hypercalemia), pancreatic disorders such as Type I or Type TI diabetes mellitus 

15 and associated complications, disorders associated with the adrenals such as hyperplasia, carcinoma, 
or adenoma of the adrenal cortex, hypertension associated with alkalosis, amyloidosis, hypokalemia, 
Cushing's disease, Liddle's syndrome, and Arnold-Healy-Gordon syndrome, pheochromocytoma 
tumors, and Addison's disease, disorders associated with gonadal steroid hormones such as: in 
women, abnormal prolactin production, infertility, endometriosis, perturbations of the menstrual 

20 cycle, polycystic ovarian disease, hyperprolactinemia, isolated gonadotropin deficiency, amenorrhea, 
galactorrhea, hermaphroditism, hirsutism and virilization, breast cancer, and, in post-menopausal 
women, osteoporosis, and, in men, Leydig cell deficiency, male climacteric phase, and germinal cell 
aplasia, hypergonadal disorders associated with Leydig cell tumors, androgen resistance associated 
with absence of androgen receptors, syndrome of 5 oc-reductase, and gynecomastia; a metabolic 

25 disorder, such as Addison's disease, cystic fibrosis, diabetes, fatty hepatocirrhosis, galactosemia, 
goiter, hyperadrenalism, hypoadrenalism, hyperparathyroidism, hypoparathyroidism, 
hypercholesterolemia, hyperthyroidism, hypothyroidism hyperlipidemia, hyperlipemia, lipid 
myopathies, obesity, lipodystrophies, and phenylketonuria, congenital adrenal hyperplasia, 
pseudovitamin D-deficiency rickets, cerebrotendinous xanthomatosis, and coumarin resistance; a 

30 reproductive disorder, such as disorders of prolactin production, infertility, including tubal disease, 
ovulatory defects, and endometriosis, disruptions of the estrous cycle, disruptions of the menstrual 
cycle, polycystic ovary syndrome, ovarian hyperstimulation syndrome, endometrial and ovarian 
tumors, uterine fibroids, autoimmune disorders, ectopic pregnancies, and teratogenesis; cancer of the 
breast, fibrocystic breast disease, and galactorrhea, disruptions of spermatogenesis, abnormal sperm 

35 physiology, cancer of the testis, cancer of the prostate, benign prostatic hyperplasia, prostatitis, 
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Peyronie's disease, impotence, carcinoma of the male breast, and gynecomastia; a neurological 
disorder, such as epilepsy, ischemic cerebrovascular disease, stroke, cerebral neoplasms, Alzheimer's 
disease, Pick's disease, Huntington's disease, dementia, Parkinson's disease and other extrapyramidal 
disorders, amyotrophic lateral sclerosis and other motor neuron disorders, progressive neural 

5 muscular atrophy, retinitis pigmentosa, hereditary ataxias, multiple sclerosis and other demyelinating 
diseases, bacterial and viral meningitis, brain abscess, subdural empyema, epidural abscess, 
suppurative intracranial thrombophlebitis, myelitis and radiculitis, viral central nervous system 
disease, prion diseases including kuru, Creutzfeldt-Jakob disease, and Gerstmann- 
Straussler-Scheinker syndrome, fatal familial insomnia, nutritional and metabolic diseases of the 

10 nervous system, neurofibromatosis, tuberous sclerosis, cerebelloretinal hemangioblastomatosis, 
encephalotrigeminal syndrome, mental retardation and other developmental disorders of the central 
nervous system, cerebral palsy, neuroskeletal disorders, autonomic nervous system disorders, cranial 
nerve disorders, spinal cord diseases, muscular dystrophy and other neuromuscular disorders, 
peripheral nervous system disorders, dermatomyositis and polymyositis, inherited, metabolic, 

1 5 endocrine, and toxic myopathies, myasthenia gravis, periodic paralysis, mental disorders including 
mood, anxiety, and schizophrenic disorders, seasonal affective disorder (SAD), akathesia, amnesia, 
catatonia, diabetic neuropathy, tardive dyskinesia, dystonias, paranoid psychoses, postherpetic 
neuralgia, and Tourette's disorder; an autoimmune/inflammatory disorder, such as acquired 
immunodeficiency syndrome (AIDS), Addison's disease, adult respiratory distress syndrome, 

20 allergies, ankylosing spondylitis, amyloidosis, anemia, asthma, atherosclerosis, autoimmune 

hemolytic anemia, autoimmune thyroiditis, autoimmune polyendocrinopathy-candidiasis-ectodermal 
dystrophy (APECED), bronchitis, cholecystitis, contact dermatitis, Crohn's disease, atopic dermatitis, 
dermatomyositis, diabetes mellitus, emphysema, episodic lymphopenia with lymphocytotoxins, . 
erythroblastosis fetalis, erythema nodosum, atrophic gastritis, glomerulonephritis, Goodpasture's 

25 syndrome, gout, Graves' disease, Hashimoto's thyroiditis, hypereosinophilia, irritable bowel 
syndrome, multiple sclerosis, myasthenia gravis, myocardial or pericardia] inflammation, 
osteoarthritis, osteoporosis, pancreatitis, polymyositis, psoriasis, Reiter's syndrome, rheumatoid 
arthritis, scleroderma, Sjogren's syndrome, systemic anaphylaxis, systemic lupus erythematosus, 
systemic sclerosis, thrombocytopenic purpura, ulcerative colitis, uveitis, Werner syndrome, 

30 complications of cancer, hemodialysis, and extracorporeal circulation, viral, bacterial, fungal, 
parasitic, protozoal, and helminthic infections, and trauma; and a viral disorder, such as viral 
infections, e.g., those caused by adenoviruses (acute respiratory disease, pneumonia), arenaviruses 
(lymphocytic choriomeningitis), bunyavimses (Hantavirus), coronaviruses (pneumonia, chronic 
bronchitis), hepadnaviruses (hepatitis), herpesviruses (herpes simplex virus, varicella-zoster virus, 

35 Epstein-Barr virus, cytomegalovirus), flaviviruses (yellow fever), orthomyxoviruses (influenza), 

51 



WO 01/44448 



PCT/US00/33158 



papillomaviruses (cancer), paramyxoviruses (measles, mumps), picornoviruses (rhinovirus, 
poliovirus, coxsackie- virus), polyomaviruses (BK virus, JC virus), poxviruses (smallpox), reo virus 
(Colorado tick fever), retroviruses (human immunodeficiency vims, human T lymphotropic virus), 
rhabdoviruses (rabies), rotaviruses (gastroenteritis), and togaviruses (encephalitis, rubella). The 

5 polynucleotide sequences encoding ORP may be used in Southern or northern analysis, dot blot, or 
other membrane-based technologies; in PCR technologies; in dipstick, pin, and multiformat ELISA-like 
assays; and in microarrays utilizing fluids or tissues from patients to detect altered ORP expression. 
Such qualitative or quantitative methods are well known in the art. 

In a particular aspect, the nucleotide sequences encoding ORP may be useful in assays that 

10 detect the presence of associated disorders, particularly those mentioned above. The nucleotide 
sequences encoding ORP may be labeled by standard methods and added to a fluid or tissue sample 
from a patient under conditions suitable for the formation of hybridization complexes. After a suitable 
incubation period, the sample is washed and the signal is quantified and compared with a standard 
value. If the amount of signal in the patient sample is significantly altered in comparison to a control 

15 sample then the presence of altered levels of nucleotide sequences encoding ORP in the sample indicates 
the presence of the associated disorder. Such assays may also be used to evaluate the efficacy of a 
particular therapeutic treatment regimen in animal studies, in clinical trials, or to monitor the treatment 
of an individual patient. 

In order to provide a basis for the diagnosis of a disorder associated with expression of ORP, a 
20 normal or standard profile for expression is established This may be accomplished by combining body 
fluids or cell extracts taken from normal subjects, either animal or human, with a sequence, or a 
fragment thereof, encoding ORP, under conditions suitable for hybridization or amplification. Standard 
hybridization may be quantified by comparing the values obtained from normal subjects with values 
from an experiment in which a known amount of a substantially purified polynucleotide is used. 
25 Standard values obtained in this manner may be compared with values obtained from samples from 
patients who are symptomatic for a disorder. Deviation from standard values is used to establish the 
presence of a disorder. 

Once the presence of a disorder is established and a treatment protocol is initiated, 
hybridization assays may be repeated on a regular basis to determine if the level of expression in the 
30 patient begins to approximate that which is observed in the normal subject The results obtained from 
successive assays may be used to show the efficacy of treatment over a period ranging from several 
days to months. 

With respect to cancer, the presence of an abnormal amount of transcript (either under- or 
overexpressed) in biopsied tissue from an individual may indicate a predisposition for the development 
35 of the disease, or may provide a means for detecting the disease prior to the appearance of actual 
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clinical symptoms. A more definitive diagnosis of this type may allow health professionals to employ 
preventative measures or aggressive treatment earlier thereby preventing the development or further 
progression of the cancer. 

Additional diagnostic uses for oligonucleotides designed from the sequences encoding ORP may 

5 involve the use of PCR. These oligomers may be chemically synthesized, generated enzymatically, or 
produced in vitro . Oligomers will preferably contain a fragment of a polynucleotide encoding ORP, or 
a fragment of a polynucleotide complementary to the polynucleotide encoding ORP, and will be 
employed under optimized conditions for identification of a specific gene or conditioa Oligomers may 
also be employed undo- less stringent conditions for detection or quantification of closely related DNA 

10 or RNA sequences. 

In a particular aspect, oligonucleotide primers derived from the polynucleotide sequences 
encoding ORP may be used to detect single nucleotide polymorphisms (SNPs). SNPs are substitutions, 
insertions and deletions that are a frequent cause of inherited or acquired genetic disease in humans. 
Methods of SNP detection include, but arc not limited to, single-stranded conformation polymorphism 

15 (SSCP) and fluorescent SSCP (fSSCP) methods. In SSCP, oligonucleotide primers derived from the 
polynucleotide sequences encoding ORP are used to amplify DNA using the polymerase chain reaction 
(PCR). The DNA may be derived, for example, from diseased or normal tissue, biopsy samples, bodily 
fluids, and the like. SNPs in the DNA cause differences in the secondary and tertiary structures of PCR 
products in single-stranded form, and these differences are detectable using gel electrophoresis in non- 
20 denaturing gels. In fSCCP, the oligonucleotide primers are fluorescently labeled, which allows 
detection of the amplimers in high-throughput equipment such as DNA sequencing machines. 
Additionally, sequence database analysis methods, termed in silico SNP (isSNP), are capable of 
identifying polymorphisms by comparing the sequence of individual ova-lapping DNA fragments which 
assemble into a common consensus sequence. These compute-based methods filter out sequence 

25 variations due to laboratory preparation of DNA and sequencing errors using statistical models and 
automated analyses of DNA sequence chromatograms. In the alternative, SNPs may be detected and 
characterized by mass spectrometry using, for example, the high throughput MASS ARRAY system 
(Sequenom, Inc., San Diego CA). 

Methods which may also be used to quantify the expression of ORP include radiolabding or 

30 biotinylating nucleotides, coamplification of a control nucleic acid, and interpolating results from 

standard curves. (See, e.g., Melby, P.C. et al. (1993) J. Immunol. Methods 159:235-244; Duplaa, C. et 
al. (1993) Anal. Biochem. 212:229-236.) The speed of quantitation of multiple samples may be 
accelerated by running the assay in a high-throughput format where the oligomer or polynucleotide of 
interest is presented in various dilutions and a spectrophotometric or colorimetric response gives rapid 

35 quantitatioa 
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In further embodiments, oligonucleotides or longer fragments derived from any of the 
polynucleotide sequences described herein may be used as elements on a microarray. The microarray 
can be used in transcript imaging techniques which monitor the relative expression levels of large 
numbers of genes simultaneously as described in Seilhamer, J J. et al., "Comparative Gene Transcript 
5 Analysis," U.S. Patent No. 5,840,484, incorporated herein by reference. The microarray may also be 
used to identify genetic variants, mutations, and polymorphisms. This information may be used to 
determine gene function, to understand the genetic basis of a disorder, to diagnose a disorder, to monitor 
progression/regression of disease as a function of gene expression, and to develop and monitor the 
activities of therapeutic agents in the treatment of disease. In particular, this information may be used 

10 to develop a pharmacogenomic profile of a patient in order to select the most appropriate and effective 
treatment regimen for that patient. For example, therapeutic agents which are highly effective and 
display the fewest side effects may be selected for a patient based on his/her pharmacogenomic profile. 

In another embodiment, antibodies specific for ORP, or ORP or fragments thereof may be used 
as dements on a microarray. The microarray may be used to monitor or measure protein-protein 

15 interactions, drug-target interactions, and gene expression profiles, as described above. 

A particular embodiment relates to the use of the polynucleotides of the present invention to 
generate a transcript image of a tissue or cell type. A transcript image represents the global pattern of 
gene expression by a particular tissue or cell type. Global gene expression patterns are analyzed by 
quantifying the number of expressed genes and their relative abundance under given conditions and at a 

20 given time. (See Seilhamer et a!., "Comparative Gene Transcript Analysis/' U.S. Patent Number 
5,840,484, expressly incorporated by reference hereia) Thus a transcript image may be generated by 
hybridizing the polynucleotides of the present invention or their complements to the totality of 
transcripts or reverse transcripts of a particular tissue or cell type. In one embodiment, the 
hybridization takes place in high-throughput format, wherein the polynucleotides of the present 

25 invention or their complements comprise a subset of a plurality of elements on a microarray. The 
resultant transcript image would provide a profile of gene activity. 

Transcript images may be generated using transcripts isolated from tissues, cell lines, biopsies, 
or other biological samples. The transcript image may thus reflect gene expression in vivo, as in the 
case of a tissue or biopsy sample, or in vitro , as in the case of a cell line. 

30 Transcript images which profile the expression of the polynucleotides of the present invention 

may also be used in conjunction with in vitro model systems and preclinical evaluation of 
pharmaceuticals, as well as toxicological testing of industrial and naturally-occurring environmental 
compounds. All compounds induce characteristic gene expression patterns, frequently tamed 
molecular fingerprints or toxicant signatures, which are indicative of mechanisms of action and toxicity 

35 (Nuwaysir, E.F. et al. (1999) Mol. Carcinog. 24:153-159; Sleiner, S. and N.L. Anderson (2000) 
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Toxicol. Lett. 112-11 3:467-471 , expressly incorporated by reference herein). If a test compound has a 
signature similar to that of a compound with known toxicity, it is likely to share those toxic properties. 
These fingerprints or signatures are most useful and refined when they contain expression information 
from a large number of genes and gene families. Ideally, a genome-wide measurement of expression 
5 provides the highest quality signature. Even genes whose expression is not altered by any tested 
compounds are important as well, as the levels of expression of these genes are used to normalize the 
rest of the expression data. The normalization procedure is useful for comparison of expression data 
after treatment with different compounds. While the assignment of gene function to elements of a 
toxicant signature aids in interpretation of toxicity mechanisms, knowledge of gene function is not 
10 necessary for the statistical matching of signatures which leads to prediction of toxicity. (See, for 
example, Press Release 00-02 from the National Institute of Environmental Health Sciences, released 
February 29, 2000, available at httpy/www.nidis.nih.gov/oc/news/toxchip.htm.) Therefore, it is 
important and desirable in toxicological screening using toxicant signatures to include all expressed 
gene sequences. 

15 In one embodiment, the toxicity of a test compound is assessed by treating a biological sample 

containing nucleic acids with the test compound. Nucleic acids that are expressed in the treated 
biological sample are hybridized with one or more probes specific to the polynucleotides of the 
present invention, so that transcript levels corresponding to the polynucleotides of the present 
invention may be quantified. The transcript levels in the treated biological sample are compared with 

20 levels in an untreated biological sample. Differences in the transcript levels between the two samples 
are indicative of a toxic response caused by the test compound in the treated sample. 

Another particular embodiment relates to the use of the polypeptide sequences of the present 
invention to analyze the proteome of a tissue or cell type. The term proteome refers to the global 
pattern of protein expression in a particular tissue or cell type. Each protein component of a proteome 

25 can be subjected individually to further analysis. Proteome expression pattens, or profiles, are 
analyzed by quantifying the number of expressed proteins and their relative abundance under given 
conditions and at a given time. A profile of a cell's proteome may thus be generated by separating and 
analyzing the polypeptides of a particular tissue or cell type. In one embodiment, the separation is 
achieved using two-dimensional gel electrophoresis, in which proteins from a sample are separated by 

30 isoelectric focusing in the first dimension, and then according to molecular weight by sodium dodecyl 
sulfate slab gel electrophoresis in the second dimension (Steiner and Anderson, supra) . The proteins are 
visualized in the gel as discrete and uniquely positioned spots, typically by staining the gel with an agent 
such as Coomassie Blue or silver or fluorescent stains. The optical density of each protein spot is 
generally proportional to the level of the protein in the sample. The optical densities of equivalently 

35 positioned proton spots from different samples, for example, from biological samples either treated or 
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untreated with a test compound or therapeutic agent, are compared to identify any changes in protein 
spot density related to the treatment The proteins in the spots are partially sequenced using, for 
example, standard methods employing chemical or enzymatic cleavage followed by mass spectrometry. 
The identity of the protein in a spot may be determined by comparing its partial sequence, preferably of 

5 at least 5 contiguous amino acid residues, to the polypeptide sequences of the present invention. In 
some cases, further sequence data may be obtained for definitive protein identification. 

A proteomic profile may also be generated using antibodies specific for ORP to quantify the 
levels of ORP expression. In one embodiment, the antibodies are used as elements on a microarray, and 
protein expression levels are quantified by exposing the microarray to the sample and detecting the 

10 levels of proton bound to each array element (Lueking, A. et al. (1999) Anal. Biochem. 270:103-1 1 1 ; 
Mendoze, L.G. et al. (1999) Biotechniques 27:778-788). Detection may be performed by a variety of 
methods known in the art, for example, by reacting the proteins in the sample with a thiol- or amino 
reactive fluorescent compound and detecting the amount of fluorescence bound at each array element 

Toxicant signatures at the proteome level are also useful for toxicologic^ screening, and should 

15 be analyzed in parallel with toxicant signatures at the transcript level. There is a poor correlation 
between transcript and protein abundances for some proteins in some tissues (Anderson, N.L. and J. 
Seilhamer (1997) Electrophoresis 18:533-537), so proteome toxicant signatures may be useful in the 
analysis of compounds which do not significantly affect the transcript image, but which alter the 
proteomic profile. In addition, the analysis of transcripts in body fluids is difficult, due to rapid 

20 degradation of mRNA, so proteomic profiling may be more reliable and informative in such cases. 

In another embodiment, the toxicity of a test compound is assessed by treating a biological 
sample containing proteins with the test compound. Proteins that are expressed in the treated biological 
sample are separated so that the amount of each protein can be quantified. The amount of each protein 
is compared to the amount of the corresponding protein in an untreated biological sample. A difference 

25 in the amount of protein between the two samples is indicative of a toxic response to the test compound 
in the treated sample. Individual proteins are identified by sequencing the amino acid residues of the 
individual proteins and comparing these partial sequences to the polypeptides of the present invention. 

In another embodiment, the toxicity of a test compound is assessed by treating a biological 
sample containing proteins with the test compound. Proteins from the biological sample are incubated 

30 with antibodies specific to the polypeptides of the present invention. The amount of protein recognized 
by the antibodies is quantified. The amount of protein in the treated biological sample is compared with 
the amount in an untreated biological sample. A difference in the amount of protein between the two 
samples is indicative of a toxic response to the test compound in the treated sample* 

Microarrays may be prepared, used, and analyzed using methods known in the art. (See, e.g., 

35 Brennan, T.M. et al. (1995) U.S. Patent No. 5,474,796; Schena, M. et al. (1996) Proc. Natl. Acad. Sci. 
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USA 93:10614-10619; Baldeschweiler et al. (1995) PCT application W095/251116; Shalon, D. et al. 
(1995) PCT application WO95/35505; Hdler, R.A. et al. (1997) Proc. Natl. Acad Sci. USA 94:2150- 
2155; and Heller, M.J. et al. (1997) U.S. Patent No. 5,605,662.) Various types of microarrays are well 
known and thoroughly described in DNA Microarrays: A Practical Approach , M. Schena, ed (1999) 
5 Oxford University Press, London, hereby expressly incorporated by reference. 

In another embodiment of the invention, nucleic acid sequences encoding ORP may be used to 
generate hybridization probes useful in mapping the naturally occurring genomic sequence. Either 
coding or noncoding sequences may be used, and in some instances, noncoding sequences may be 
preferable over coding sequences. For example, conservation of a coding sequence among members 

10 of a multi-gene family may potentially cause undesired cross hybridization during chromosomal 
mapping. The sequences may be mapped to a particular chromosome, to a specific region of a 
chromosome, or to artificial chromosome constructions, e.g., human artificial chromosomes (HACs), 
yeast artificial chromosomes (YACs), bacterial artificial chromosomes (BACs), bacterial PI 
constructions, or single chromosome cDNA libraries. (See, e.g., Harrington, J J. et al. (1997) Nat. 

15 Genet. 15:345-355; Price, CM. (1993) Blood Rev. 7:127-134; and Trask, B.J. (1991) Trends Genet. 
7:149-154.) Once mapped, the nucleic acid sequences of the invention may be used to develop genetic 
linkage maps, for example, which correlate the inheritance of a disease state with the inheritance of a 
particular chromosome region or restriction fragment length polymorphism (RFLP). (See, e.g., 
Lander, E.S. and D. Botstein (1986) Proc. Natl. Acad. Sci. USA 83:7353-7357.) 

20 Fluorescent in situ hybridization (FISH) may be correlated with other physical and genetic map 

data. (See, e.g., Heinz-Ulrich, et al. (1995) in Meyers, supra , pp. 965-968.) Examples of genetic map 
data can be found in various scientific journals or at the Online Mendelian Inheritance in Man (OMIM) 
World Wide Web site. Correlation between the location of the gene encoding ORP on a physical map 
and a specific disorder, or a predisposition to a specific disorder, may help define the region of DNA 

25 associated with that disorder and thus may further positional cloning efforts. 

In situ hybridization of chromosomal preparations and physical mapping techniques, such as 
linkage analysis using established chromosomal markers, may be used for extending genetic maps. 
Often the placement of a gene on the chromosome of another mammalian species, such as mouse, may 
reveal associated markers even if the exact chromosomal locus is not known. This information is 

30 valuable to investigators searching for disease genes using positional cloning or other gene discovery 
techniques. Once the gene or genes responsible for a disease or syndrome have been crudely localized 
by genetic linkage to a particular genomic region, e.g., ataxia-telangiectasia to 1 lq22-23, any sequences 
mapping to that area may represent associated or regulatory genes for further investigation. (See, e.g., 
Gatti, R. A et al. (1988) Nature 336:577-580.) The nucleotide sequence of the instant invention may 

35 also be used to detect differences in the chromosomal location due to translocation, inversion, etc., 
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among normal, carrier, or. affected individuals. 

In another embodiment of the invention, ORP, its catalytic or immunogenic fragments, or 
oligopeptides thereof can be used for screening libraries of compounds in any of a variety of drug 
screening techniques. The fragment employed in such screening may be free in solution, affixed to a 
5 solid support, borne on a cell surface, or located intracellular^. The formation of binding complexes 
between ORP and the agent being tested may be measured. 

Another technique for drug screening provides for high throughput screening of compounds 
having suitable binding affinity to the protein of interest (See, e.g., Geysen, et al. (1984) PCT 
application WO84/03564.) In this method, large numbers of differ ait small test compounds are 
10 synthesized on a solid substrate. The test compounds are reacted with ORP, or fragments thereof, and 
washed Bound ORP is then detected by methods well known in the art. Purified ORP can also be 
coated directly onto plates for use in the aforementioned drug screening techniques. Alternatively, 
non-neutralizing antibodies can be used to capture the peptide and immobilize it on a solid support. 

In another embodiment, one may use competitive drug screening assays in which neutralizing 
1 5 antibodies capable of binding ORP specifically compete with a test compound for binding ORP. In this 
manner, antibodies can be used to detect the presence of any peptide which shares one or more antigenic 
determinants with ORP. 

In additional embodiments, the nucleotide sequences which encode ORP may be used in any 
molecular biology techniques that have yet to be developed, provided the new techniques rely on 
20 properties of nucleotide sequences that are currently known, including, but not limited to, such 
properties as the triplet genetic code and specific base pair interactions. 

Without further elaboration, it is believed that one skilled in the art can, using the preceding 
description, utilize the present invention to its fullest extent. The following preferred specific 
embodiments are, therefore, to be construed as merely illustrative, and not limitative of the remainder 
25 of the disclosure in any way whatsoever. 

The disclosures of all patents, applications, and publications mentioned above and below, in 
particular U.S. Ser. No. 60/172,367, are hereby expressly incorporated by reference. 

EXAMPLES 

30 I. Construction of cDNA Libraries 

RNA was purchased from Clontech or isolated from tissues described in Table 4. Some tissues 
were homogenized and lysed in guanidinium isothiocyanate, while others were homogenized and lysed 
in phenol or in a suitable mixture of denaturants, such as TRIZOL (Life Technologies), a monophasic 
solution of phenol and guanidine isothiocyanate. The resulting lysates were centrifuged over CsCl 
35 cushions or extracted with chloroform. RNA was precipitated from the lysates with either isopropanol 
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or sodium acetate and ethanol, or by other routine methods. 

Phenol extraction and precipitation of RNA were repeated as necessary to increase RNA 
purity. In some cases, RNA was treated with DNase. For most libraries, poly(A+) RNA was isolated 
using oligo d(T)-coupled paramagnetic particles (Promega), OLIGOTEX latex particles (QIAGEN, 
5 Chatsworth C A), or an OLIGOTEX mRNA purification kit (QIAGEN). Alternatively, RNA was 
isolated directly from tissue lysates using other RNA isolation kits, e.g„ the POLY(A)PURE mRNA 
purification kit (Ambion, Austin TX). 

In some cases, Stratagene was provided with RNA and constructed the corresponding cDNA 
libraries. Otherwise, cDNA was synthesized and cDNA libraries were constructed with the UNIZAP 

10 vector system (Stratagene) or SUPERSCRIPT plasmid system (Life Technologies), using the 

recommended procedures or similar methods known in the art. (See, e.g., Ausubel, 1997, supra , units 
5 . 1 -6.6.) Reverse transcription was initiated using oligo d(T) or random primers. Synthetic 
oligonucleotide adapters were ligated to double stranded cDNA, and the cDNA was digested with the 
appropriate restriction enzyme or enzymes. For most libraries, the cDNA was size-selected (300-1000 

15 bp) using SEPHACRYL S1000, SEPHAROSE CL2B, or SEPHAROSE CL4B column 

chromatography (Amersham Pharmacia Biotech) or preparative agarose gel electrophoresis. cDNAs 
were ligated into compatible restriction enzyme sites of the polylinker of a suitable plasmid, e.g., 
PBLUESCRIPT plasmid (Stratagene), PSPORT1 plasmid (Life Technologies), pcDNA2. 1 plasmid 
(Invitrogen, Carlsbad CA), or pINC Y plasmid (Incyte Genomics, Palo Alto C A). Recombinant 

20 plasmids were transformed into competent E. coli cells including XL1 -Blue, XL1 -BlueMRF, or SOLR 
from Stratagene or DH5a, DH10B, or ElectroMAX DH10B from Life Technologies. 
II. Isolation of cDNA Clones 

Plasmids obtained as described in Example I were recovered from host cells by in vivo excision 
using the UNIZAP vector system (Stratagene) or by cell lysis. Plasmids were purified using at least 

25 one of the following: a Magic or WIZARD Minipreps DNA puriflcation system (Promega); an AGTC 
Miniprep purification kit (Edge Biosystems, Gaithersburg MD); and QIAWELL 8 Plasmid, QIAWELL 
8 Plus Plasmid, QIAWELL 8 Ultra Plasmid purification systems or the R.E. A.L. PREP 96 plasmid 
purification kit from QIAGEN. Following precipitation, plasmids were resuspended in 0.1 ml of 
distilled water and stored, with or without lyophilization, at 4 C. 

30 Alternatively, plasmid DNA was amplified from host cell lysates using direct link PCR in a 

high-throughput format (Rao, V.B. (1994) Anal. Biochem. 216:1-14). Host cell lysis and thermal 
cycling steps were carried out in a single reaction mixture. Samples were processed and stored in 384- 
well plates, and the concentration of amplified plasmid DNA was quantified fluorometrically using 
PICOGREEN dye (Molecular Probes, Eugene OR) and a FLUOROSKAN II fluorescence scanner 

35 (Labsystems Oy, Helsinki, Finland). 
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III* Sequencing and Analysis 

Incyte cDNA recovered in plasmids as described in Example II were sequenced as follows. 
Sequencing reactions were processed using standard methods or high-throughput instrumentation 
such as the ABI CATALYST 800 (Applied Biosystems) thermal cycler or the PTC-200 thermal cycler 
5 (MJ Research) in conjunction with the HYDRA microdispenser (Robbins Scientific) or the 

MICROLAB 2200 (Hamilton) liquid transfer system cDNA sequencing reactions were prepared 
using reagents provided by Amersham Pharmacia Biotech or supplied in ABI sequencing kits such as 
the ABI PRISM BIGDYE Terminator cycle sequencing ready reaction kit (Applied Biosystems). 
Electrophoretic separation of cDNA sequencing reactions and detection of labeled polynucleotides were 

10 carried out using the MEGABACE 1000 DNA sequencing system (Molecular Dynamics); the ABI 
PRISM 373 or 377 sequencing system (Applied Biosystems) in conjunction with standard ABI 
protocols and base calling software; or other sequence analysis systems known in the art. Reading 
frames within the cDNA sequences were identified using standard methods (reviewed in Ausubel, 1997, 
supra , unit 7.7). Some of the cDNA sequences were selected for extension using the techniques 

1 5 disclosed in Example VI. 

The polynucleotide sequences derived from cDNA sequencing were assembled and analyzed 
using a combination of software programs which utilize algorithms well known to those skilled in the 
art. Table 5 summarizes the tools, programs, and algorithms used and provides applicable descriptions, 
references, and threshold parameters. The first column of Table 5 shows the tools, programs, and 

20 algorithms used, the second column provides brief descriptions thereof, the third column presents 

appropriate references, all of which are incorporated by reference herein in their entirety, and the fourth 
column presents, where applicable, the scores, probability values, and other parameters used to evaluate 
the strength of a match between two sequences (the higher the score, the greater the homology between 
two sequences). Sequences were analyzed using MACDNASIS PRO software (Hitachi Software 

25 Engineering, South San Francisco CA) and LASER GENE software (DNASTAR). Polynucleotide and 
polypeptide sequence alignments were generated using the default parameters specified by the clustal 
algorithm as incorporated into the MEGALIGN multi sequence alignment program (DNASTAR), which 
also calculates the percent identity between aligned sequences. 

The polynucleotide sequences were validated by removing vector, linker, and poly A sequences 

30 and by masking ambiguous bases, using algorithms and programs based on BLAST, dynamic 

programing, and dinucleotide nearest neighbor analysis. The sequences were then queried against a 
selection of public databases such as the GenBank primate, rodent, mammalian, vertebrate, and 
eukaryote databases, and BLOCKS, PRINTS, DOMO, PRODOM, and PFAM to acquire annotation 
using programs based on BLAST, FAST A, and BLIMPS. The sequences were assembled into full 

35 length polynucleotide sequences using programs based on Phrcd, Phrap, and Conscd, and were screened 
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for open reading frames using programs based on GeneMark, BLAST, and FASTA. The full length 

polynucleotide sequences were translated to derive the corresponding full length amino acid sequences, 

and these full length sequences were subsequently analyzed by querying against databases such as the 

GenBank databases (described above), SwissProt, BLOCKS, PRINTS, DOMO, PRODOM, Prosite, 
5 and Hidden Markov Model (HMM)-based protein family databases such as PFAM. HMM is a 

probabilistic approach which analyzes consensus primary structures of gene families. (See, e.g., 

Eddy, S.R. (1996) Curr. Opin. Struct. Biol. 6:361-365.) 

The programs described above for the assembly and analysis of full length polynucleotide and 

amino acid sequences were also used to identify polynucleotide sequence fragments from SEQ ID 
10 NO:28-54. Fragments from about 20 to about 4000 nucleotides which are useful in hybridization and 

amplification technologies were described in The Invention section above. 

IV, Analysis of Polynucleotide Expression 

Northern analysis is a laboratory technique used to detect the presence of a transcript of a gene 

and involves the hybridization of a labeled nucleotide sequence to a membrane on which RNAs from a 
15 particular cell type or tissue have been bound. (See, e.g., Sambrook, supra , ch. 7; Ausubel, 1995, 

supra , ch. 4 and. 16.) 

Analogous computer techniques applying BLAST were used to search for identical or related 
molecules in cDNA databases such as GenBank or LIFESEQ (Incyte Genomics). This analysis is 
much faster than multiple membrane-based hybridizations. In addition, the sensitivity of the computer 
20 search can be modified to determine whether any particular match is categorized as exact or similar. 
The basis of the search is the product score, which is defined as: 

BLAST Score x Percent Identity 
5 x minimum {length(Seq. 1), length(Seq. 2)} 

25 The product score takes into account both the degree of similarity between two sequences and the length 
of the sequence match. The product score is a normalized value between 0 and 100, and is calculated 
as follows: the BLAST score is multiplied by the percent nucleotide identity and the product is divided 
by (5 times the length of the shorter of the two sequences). The BLAST score is calculated by 
assigning a score of +5 for every base that matches in a high-scoring segment pair (HSP), and -4 for 

30 every mismatch. Two sequences may share more than one HSP (separated by gaps). If there is more 
than one HSP, then the pair with the highest BLAST score is used to calculate the product score. The 
product score represents a balance between fractional overlap and quality in a BLAST alignment For 
example, a product score of 100 is produced only for 100% identity over the entire length of the shorter 
of the two sequences being compared A product score of 70 is produced either by 1 00% identity and 

35 70% overlap at one end, cm- by 88% identity and 100% overlap at the other. A product score of 50 is 
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produced either by 100% identity and 50% overlap at one end, or 79% identity and 100% overlap. 

The results of northern analyses are reported as a percentage distribution of libraries in which 
the transcript encoding ORP occurred Analysis involved the categorization of cDNA libraries by 
organ/tissue and disease. The organ/tissue categories included cardiovascular, dermatologic, 
5 developmental, endocrine, gastrointestinal, hematopoietic/immune, musculoskeletal, nervous, 

reproductive, and urologic. The disease/condition categories included cancer, inflammation, trauma, 
cell proliferation, neurological, and pooled. For each category, the number of libraries expressing the 
sequence of interest was counted and divided by the total number of libraries across all categories. 
Percentage values of tissue-specific and disease- or condition-specific expression are reported in Table 
10 3. 

V. Chromosomal Mapping of ORP Encoding Polynucleotides 

The cDN A sequences which were used to assemble SEQ ID NO:28-54 were compared with 
sequences from the Incyte LIFESEQ database and public domain databases using BLAST and other 
implementations of the Smith- Waterman algorithm. Sequences from these databases that matched 

15 SEQ ID NO:28-54 were assembled into clusters of contiguous and overlapping sequences using 
assembly algorithms such as Phrap (Table 5). Radiation hybrid and genetic mapping data available 
from public resources such as the Stanford Human Genome Center (SHGC), Whitehead Institute for 
Genome Research (WTGR), and G6n6thon were used to determine if any of the clustered sequences 
had been previously mapped. Inclusion of a mapped sequence in a cluster resulted in the assignment 

20 of all sequences of that cluster, including its particular SEQ ID NO:, to that map location. 

The genetic map locations of SEQ ID NO:3 1 , SEQ ID NO:42, SEQ ID NO:48, and SEQ ID 
NO:53 are described in The Invention as ranges, or intervals, of human chromosomes. More than one 
map location is reported for SEQ ID NO:48, indicating that previously mapped sequences having 
similarity, but not complete identity, to SEQ ID NO:48 were assembled into their respective clusters. 

25 The map position of an interval, in centiMorgans, is measured relative to the terminus of the 
chromosome's p-arm. (The centiMorgan (cM) is a unit of measurement based on recombination 
frequencies between chromosomal markers. On average, 1 cM is roughly equivalent to 1 megabase 
(Mb) of DNA in humans, although this can vary widely due to hot and cold spots of recombination.) 
The cM distances are based on genetic markers mapped by G6n6thon which provide boundaries for 

30 radiation hybrid markers whose sequences were included in each of the clusters. Human genome 
maps and other resources available to the public, such as the NCBI "GeneMap'99" World Wide Web 
site (http://www.ncbi.nlm.nih.gov/genemap/), can be employed to determine if previously identified 
disease genes map within or in proximity to the intervals indicated above. 

VI. Extension of ORP Encoding Polynucleotides 

35 The full length nucleic acid sequences of SEQ ID NO:28-54 were produced by extension of an 
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appropriate fragment of the full length molecule using oligonucleotide primers designed from this 
fragment One prima* was synthesized to initiate 5' extension of the known fragment, and the otho" 
primer, to initiate 3' extension of the known fragment. The initial primers were designed using OLIGO 
4.06 software (National Biosciences), or another appropriate program, to be about 22 to 30 nucleotides 
5 in length, to have a GC content of about 50% or more, and to anneal to the target sequence at 

temperatures of about 68 C to about 72 C. Any stretch of nucleotides which would result in hairpin 
structures and primer-primer dimerizations was avoided. 

Selected human cDN A libraries were used to extend the sequence. If more than one extension 
was necessary or desired, additional or nested sets of primers were designed 

10 High fidelity amplification was obtained by PCR using methods well known in the art. PCR 

was performed in 96-well plates using the PTC-200 thermal cycler (MJ Research, Inc.). The reaction 
mix contained DNA template, 200 nmol of each primer, reaction buffer containing Mg 2 *, (NH 4 ) 2 S0 4 , 
and p-mercaptoethanol, Taq DNA polymerase (Amersham Pharmacia Biotech), ELONGASE enzyme 
(Life Technologies), and Pfix DNA polymerase (Stratagene), with the following parameters for prima: 

15 pair PCI A and PCI B: Step 1 : 94 C, 3 min; Step 2: 94 C, 15 sec; Step 3: 60 C, 1 min; Step 4: 68 C, 
2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 68 C, 5 min; Step 7: storage at 4 C. In the 
alternative, the parameters for prima- pair T7 and SK+ were as follows: Step 1 : 94 C, 3 min; Step 2: 
94 C, 15 sec; Step 3: 57 C, 1 min; Step 4: 68 C, 2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; 
Step 6: 68 C, 5 min; Step 7: storage at 4 C. 

20 The concentration of DNA in each well was determined by dispensing 100 \xl P1COGREEN 

quantitation reagent (0.25% (v/v) PICOGREEN; Molecular Probes, Eugene OR) dissolved in IX TE 
and 0.5 jal of undiluted PCR product into each well of an opaque fluorimeter plate (Corning Costar, 
Acton MA), allowing the DNA to bind to the reagent. The plate was scanned in a Fiuoroskan II 
(Labsystems Oy, Helsinki, Finland) to measure the fluorescence of the sample and to quantify the 

25 concentration of DNA. A 5 1 to 10 1 aliquot of the reaction mixture was analyzed by electrophoresis 
on a 1 % agarose mini-gel to determine which reactions were successful in extending the sequence. 

The extended nucleotides were desalted and concentrated, transferred to 384- well plates, 
digested with CviJT cholera virus endonuclease (Molecular Biology Research, Madison WI), and 
sonicated or sheared prior to religation into pUC 18 vector (Amersham Pharmacia Biotech). For 

30 shotgun sequencing, the digested nucleotides were separated on low concentration (0.6 to 0.8%) agarose 
gels, fragments were excised, and agar digested with Agar ACE (Promega). Extended clones were 
religated using T4 ligase (New England Biolabs, Beverly MA) into pUC 18 vector (Amersham 
Pharmacia Biotech), treated with Pfu DNA polymerase (Stratagene) to fill-in restriction site overhangs, 
and transfected into competent E. coli cells. Transformed cells were selected on antibiotic-containing 

35 media, and individual colonies were picked and cultured overnight at 37 C in 384-well plates in LB/2x 
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carb liquid media. 

The cells were lysed, and DNA was amplified by PCR using Taq DNA polymerase (Amersham 
Pharmacia Biotech) and Pfu DNA polymerase (Stratagene) with the following parameters: Step 1 : 
94 C, 3 min; Step 2: 94 C, 15 sec; Step 3: 60 C, 1 min; Step 4: 72 C, 2 min; Step 5: stops 2, 3, and 4 
5 repeated 29 times; Step 6: 72 C, 5 min; Step 7: storage at 4 C. DNA was quantified by PICOGREEN 
reagent (Molecular Probes) as described above. Samples with low DNA recoveries were reamplified 
using the same conditions as described above. Samples were diluted with 20% dimethysulfoxide (1 :2, 
v/v), and sequenced using DYENAMIC energy transfer sequencing primers and the DYENAMIC 
DIRECT kit (Amersham Pharmacia Biotech) or the ABI PRISM BIGDYE Terminator cycle 

10 sequencing ready reaction kit (Applied Biosystems). 

In like manner, the polynucleotide sequences of SEQ ID NO:28-54 are used to obtain 5* 
regulatory sequences using the procedure above* along with oligonucleotides designed for such 
extension, and an appropriate genomic library. 
VII. Labeling and Use of Individual Hybridization Probes 

15 Hybridization probes derived from SEQ ID NO:28-54 are employed to screen cDNAs, genomic 

DNAs, or mRNAs. Although the labeling of oligonucleotides, consisting of about 20 base pairs, is 
specifically described, essentially the same procedure is used with larger nucleotide fragments. 
Oligonucleotides are designed using state-of-the-art software such as OLIGO 4.06 software (National 
Biosciences) and labeled by combining 50 pmol of each oligomer, 250 Ci of [y-^P] adenosine 

20 triphosphate (Amersham Pharmacia Biotech), and T4 polynucleotide kinase (DuPont NEN, Boston 
MA). The labeled oligonucleotides are substantially purified using a SEPHADEX G-25 superfine size 
exclusion dextran bead column (Amersham Pharmacia Biotech). An aliquot containing 10 7 counts per 
minute of the labeled probe is used in a typical membrane-based hybridization analysis of human 
genomic DNA digested with one of the following endonucleases: Ase I, Bgl II, Eco RI, Pst I, Xba I, or 

25 Pvu II (DuPont NEN). 

The DNA from each digest is fractionated on a 0.7% agarose gel and transferred to nylon 
membranes (Nytran Plus, Schleicher & Schuell, Durham NH). Hybridization is carried out for 16 
hours at 40 C. To remove nonspecific signals, blots are sequentially washed at room temperature 
under conditions of up to, for example, 0.1 x saline sodium citrate and 0.5% sodium dodecyl sulfate. 

30 Hybridization patterns are visualized using autoradiography or an alternative imaging means and 
compared 

VIIL Microarrays 

The linkage or synthesis of array elements upon a microarray can be achieved utilizing 
photolithography, piezoelectric printing (ink-jet printing, See, e.g., Baldeschweila-, supra) , mechanical 
35 microspotting technologies, and derivatives thereof. The substrate in each of the aforementioned 
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technologies should be uniform and solid with a non-porous surface (Schena (1999), supra) . Suggested 
substrates include silicon, silica, glass slides, glass chips, and silicon wafers. Alternatively, a procedure 
analogous to a dot or slot blot may also be used to arrange and link elements to the surface of a 
substrate using thermal, UV, chemical, or mechanical bonding procedures. A typical array may be 
5 produced using available methods and machines well known to those of ordinary skill in the art and may 
contain any appropriate number of dements. (See, e.g., Schena, M. et al. (1995) Science 270:467-470; 
Shalon, D. et al. (1996) Genome Res. 6:639-645; Marshall, A. and J. Hodgson (1998) Nat. Biotechnol. 
16:27-31.) 

Full length cDNAs, Expressed Sequence Tags (ESTs), or fragments or oligomers thereof may 

10 comprise the elements of the microarray. Fragments or oligomers suitable for hybridization can be 
selected using software well known in the art such as LASERGENE software (DNASTAR). The array 
elements are hybridized with polynucleotides in a biological sample. The polynucleotides in the 
biological sample are conjugated to a fluorescent label or other molecular tag for ease of detection. 
After hybridization, nonhybridized nucleotides from the biological sample are removed, and a 

15 fluorescence scanner is used to detect hybridization at each array element. Alternatively, laser 
desorbtion and mass spectrometry may be used for detection of hybridization. The degree of 
complementarity and the relative abundance of each polynucleotide which hybridizes to an element on 
the microarray may be assessed. In one embodiment, microarray preparation and usage is described in 
detail below. 

20 Tissue or Cell Sample Preparation 

Total RNA is isolated from tissue samples using the guanidinium thiocyanate method and 
poly(A) + RNA is purified using the oligo-(dT) cellulose method. Each poly(A) + RNA sample is 
reverse transcribed using MMLV reverse-transcriptase, 0.05 pg/pl oligo-(dT) primer (21mcr), IX first 
strand buffer, 0.03 units/pl RNase inhibitor, 500 \M dATP, 500 joM dGTP, 500 \M dTTP, 40 pM 

25 dCTP, 40 nM dCTP-Cy3 (BDS) or dCTP-Cy5 (Amersham Pharmacia Biotech). The reverse 
transcription reaction is performed in a 25 ml volume containing 200 ng poly(A) + RNA with 
GEMB RIGHT kits (Incyte). Specific control poly(A) + RNAs are synthesized by in vitro transcription 
from non-coding yeast genomic DNA. After incubation at 37 C for 2 hr, each reaction sample (one 
with Cy3 and another with Cy5 labeling) is treated with 2.5 ml of 0.5M sodium hydroxide and 

30 incubated for 20 minutes at 85 C to the stop the reaction and degrade the RNA. Samples are purified 
using two successive CHROMA SPIN 30 gel filtration spin columns (CLONTECH Laboratories, Inc. 
(CLONTECH), Palo Alto CA) and after combining, both reaction samples are ethanol precipitated 
using 1 ml of glycogen (1 mg/ml), 60 ml sodium acetate, and 300 ml of 100% ethanol. The sample is 
then dried to completion using a Speed VAC (Savant Instruments Inc., Holbrook NY) and 

35 resuspended in 14 fxL 5X SSC/0.2% SDS. 
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Microarrav Preparation 

Sequences of the present invention are used to generate array elements. Each array element is 
amplified from bacterial cells containing vectors with cloned cDN A inserts. PCR amplification uses 
primers complementary to the vector sequences flanking the cDNA insert Array dements are 
5 amplified in thirty cycles of PCR from an initial quantity of 1-2 ng to a final quantity greater than 5 
jig. Amplified array elements are then purified using SEPHACRYL-400 (Amersham Pharmacia 
Biotech). 

Purified array elements are immobilized on polymer-coated glass slides. Glass microscope 
slides (Corning) are cleaned by ultrasound in 0.1% SDS and acetone, with extensive distilled water 
10 washes between and after treatments. Glass slides are etched in 4% hydrofluoric acid (VWR 

Scientific Products Corporation (VWR), West Chester PA), washed extensively in distilled water, and 
coated with 0.05% aminopropyl silane (Sigma) in 95% ethanol. Coated slides are cured in a 1 10°C 
oven. 

Array elements are applied to the coated glass substrate using a procedure described in US 
15 Patent No. 5,807,522, incoiporated herein by reference. 1 pi of the array element DNA, at an average 

concentration of 100 ng/pl, is loaded into the open capillary printing element by a high-speed robotic 

apparatus. The apparatus then deposits about 5 nl of array element sample per slide. 

Microarrays are UV-crosslinked using a STRATALINKER UV-crosslinker (Stratagene). 

Microarrays are washed at room temperature once in 0.2% SDS and three times in distilled water. 
20 Non-specific binding sites are blocked by incubation of microarrays in 0.2% casein in phosphate 

buffered saline (PBS) (Tropix, Inc., Bedford MA) for 30 minutes at 60 C followed by washes in 

0.2% SDS and distiDed water as before. 

Hybridization 

Hybridization reactions contain 9 nl of sample mixture consisting of 0.2 [xg each of Cy3 and 
25 Cy5 labeled cDNA synthesis products in 5X SSC, 0.2% SDS hybridization buffer. The sample 

mixture is heated to 65 C for 5 minutes and is aliquoted onto the microarray surface and covered with 
an 1 .8 cm 2 coverslip. The arrays are transferred to a waterproof chamber having a cavity just slightly 
larger than a microscope slide. The chamber is kept at 100% humidity internally by the addition of 
140 nl of 5X SSC in a corner of the chamber. The chamber containing the arrays is incubated for 
30 about 6.5 hours at 60 C. The arrays are washed for 10 min at 45 C in a first wash buffer (IX SSC, 
0.1 % SDS), three times for 10 minutes each at 45 C in a second wash buffer (0.1X SSC), and dried. 
Detection 

Reporter-labeled hybridization complexes are detected with a microscope equipped with an 
Innova 70 mixed gas 10 W laser (Coherent, Inc., Santa Clara CA) capable of generating spectral lines 
35 at 488 nm for excitation of Cy3 and at 632 nm for excitation of Cy5. The excitation laser light is 
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focused on the array using a 20X microscope objective (Nikon, Inc., Melville NY). The slide 
containing the array is placed on a computer-controlled X-Y stage on the microscope and raster- 
scanned past the objective. The 1 .8 cm x 1,8 cm array used in the present example is scanned with a 
resolution of 20 micrometers. 
5 In two separate scans, a mixed gas multiline laser excites the two fluorophores sequentially. 

Emitted light is split, based on wavelength, into two photomultiplier tube detectors (PMT R1477, 
Hamamatsu Photonics Systems, Bridgewater NJ) corresponding to the two fluorophores. Appropriate 
filters positioned between the array and the photomultiplier tubes are used to filter the signals. The 
emission maxima of the fluorophores used are 565 nm for Cy3 and 650 nm for Cy5. Each array is 

10 typically scanned twice, one scan per fluorophore using the appropriate filters at the laser source, 
although the apparatus is capable of recording the spectra from both fluorophores simultaneously. 

The sensitivity of the scans is typically calibrated using the signal intensity generated by a 
cDNA control species added to the sample mixture at a known concentration. A specific location on 
the array contains a complementary DNA sequence, allowing the intensity of the signal at that 

1 5 location to be correlated with a weight ratio of hybridizing species of 1 : 1 00,000. When two samples 
from different sources (e.g., representing test and control cells), each labeled with a different 
fluorophore, are hybridized to a single array for the purpose of identifying genes that are differentially 
expressed, the calibration is done by labeling samples of the calibrating cDNA with the two 
fluorophores and adding identical amounts of each to the hybridization mixture. 

20 The output of the photomultiplier tube is digitized using a 1 2-bit RTI-835H analog-to-digital 

(A/D) conversion board (Analog Devices, Inc., Norwood MA) installed in an IBM-compatible PC 
computer. The digitized data are displayed as an image where the signal intensity is mapped using a 
linear 20-color transformation to a pseudocolor scale ranging from blue (low signal) to red (high 
signal). The data is also analyzed quantitatively. Where two different fluorophores are excited and 

25 measured simultaneously, the data are first corrected for optical crosstalk (due to overlapping 
emission spectra) between the fluorophores using each fluorophore* s emission spectrum. 

A grid is superimposed over the fluorescence signal image such that the signal from each spot 
is centered in each element of the grid. The fluorescence signal within each element is then integrated 
to obtain a numerical value corresponding to the average intensity of the signal. The software used 

30 for signal analysis is the GEMTOOLS gene expression analysis program (Incyte). 
IX. Complementary Polynucleotides 

Sequences complementary to the ORP-encoding sequences, or any parts thereof, are used to 
detect, decrease, or inhibit expression of naturally occurring ORP. Although use of oligonucleotides 
comprising from about 15 to 30 base pairs is described, essentially the same procedure is used with 

35 smaller or with larger sequence fragments. Appropriate oligonucleotides are designed using OLIGO 
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4.06 software (National Biosciences) and the coding sequence of ORP. To inhibit transcription, a 
complementary oligonucleotide is designed from the most unique 5 ' sequence and used to prevent 
promoter binding to the coding sequence. To inhibit translation, a complementary oligonucleotide is 
designed to prevent ribosomal binding to the ORP-encoding transcript 

5 X. Expression of ORP 

Expression and purification of ORP is achieved using bacterial or virus-based expression 
systems. For expression of ORP in bacteria, cDNA is subcloned into an appropriate vector containing 
an antibiotic resistance gene and an inducible promoter that directs high levels of cDNA transcription. 
Examples of such promoters include, but are not limited to, the trp-lac (tac) hybrid promoter and the 

10 T5 or T7 bacteriophage promoter in conjunction with the lac operator regulatory element. 

Recombinant vectors are transformed into suitable bacterial hosts, e.g., BL21(DE3). Antibiotic 
resistant bacteria express ORP upon induction with isopropyl beta-D-thiogalactopyranoside (1PTG). 
Expression of ORP in eukaryotic cells is achieved by infecting insect or mammalian cell lines with 
recombinant Autoeraphica californica nuclear polyhedrosis virus (AcMNPV), commonly known as 

1 5 baculovirus. The nonessential polyhedrin gene of baculovirus is replaced with cDNA encoding ORP by 
either homologous recombination or bacterial-mediated transposition involving transfer plasmid 
intermediates. Viral infectivity is maintained and the strong polyhedrin promoter drives high levels of 
cDNA transcriptioa Recombinant baculovirus is used to infect Snodoptera frueiperda (Sf9) insect 
cells in most cases, or human hepatocytes, in some cases. Infection of the latter requires additional 

20 genetic modifications to baculovirus. (See Engelhard, E.K. et al. (1994) Proc. Natl. Acad Sci. USA 
91:3224-3227; Sandig, V. et al. (1996) Hum. GeneTher. 7:1937-1945.) 

In most expression systems, ORP is synthesized as a fusion protein with, e.g., glutathione S- 
transferase (GST) or a peptide epitope tag, such as FLAG or 6-His, permitting rapid, single-step, 
affinity-based purification of recombinant fusion protein from crude cell lysates. GST, a 26-kilodalton 

25 enzyme from Schistosoma iaponicum . enables the purification of fusion proteins on immobilized 
glutathione under conditions that maintain protein activity and antigenicity (Amersham Pharmacia 
Biotech). Following purification, the GST moiety can be proteolytically cleaved from ORP at 
specifically engineered sites. FLAG, an 8 -amino acid peptide, enables immunoaffinity purification 
using commercially available monoclonal and polyclonal anti-FLAG antibodies (Eastman Kodak). 6- 

30 His, a stretch of six consecutive histidine residues, enables purification on metal-chdate resins 

(QIAGEN). Methods for protein expression and purification are discussed in Ausubd (1995, supra , 
ch. 10 and 1 6). Purified ORP obtained by these methods can be used directly in the assays shown in 
Examples XI and XV. 
XL Demonstration of ORP Activity 

35 For purposes of example, an assay demonstrating the activity of a short-chain alcohol 
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dehydrogenase is described. Essentially the same method is used for other types of oxidoreductases, 
with suitable substitution of the substrate and electron acceptor. ORP activity is demonstrated by the 
oxidation of NADPH to NADP in the presence of substrate (Kunau and Dommes (1978) Eur. J. 
Biochem. 91 :533-544). Substrates include, but arc not limited to, all-trans-retinaldehyde and cisA- 
5 dienoyl-CoA. ORP is preincubated for 10 minutes at 37 C in 60 pM potassium phosphate (pH 7.4), 
125 nM NADPH, and 0.2 pM CoA (coenzyme A). The reaction is initiated by addition of the 
appropriate substrate (12.5 to 150 pM final concentration). The change in absorbance of the reaction 
at 340 nm, due to the oxidation of NADPH to NADP, is measured using a spectrophotometer at 23 
°C Units of ORP activity are expressed as pinoles of NADP formed per minute. A reaction lacking 

1 0 ORP is used as a negative control . 

Alternatively, ORP activity is assayed by measuring the reduction of insulin. Aliquots of 
ORP are preincubated at 37 °C for 20 min with 2 pi of 50 mM Hepes, pH 7.6, 100 pg/ml bovine 
serum albumin, and 2 mM DTT in a total volume of 70 pi. Then, 40 pi of a reaction mixture 
composed of 200 pi of Hepes (1 M), pH 7.6, 40 pi of EDTA (0.2 M), 40 pi of NADPH (40 mg/ml), 

15 and 500 pi of insulin (10 mg/ml) is added. The reaction is initiated with the addition of 10 pi of 

thioredoxin reductase from calf thymus (3.0 A412 unit), and incubation is continued for 20 min at 37 
°C. The reaction rate is followed by monitoring the oxidation of NADPH at 412 nM. The oxidation 
of NADPH is proportional to the amount of insulin reduction. 
XII. Functional Assays 

20 ORP function is assessed by expressing the sequences encoding ORP at physiologically 

elevated levels in mammalian cell culture systems. cDNA is subcloned into a mammalian expression 
vector containing a strong promoter that drives high levels of cDNA expression. Vectors of choice 
include pCMV SPORT plasmid (Life Technologies) and pCR3.1 plasmid (Invitrogen), both of which 
contain the cytomegalovirus promoter. 5-10 g of recombinant vector are transiently transfected into a 

25 human cell line, for example, an endothelial or hematopoietic cell line, using either liposome 

formulations or electroporatioa 1-2 g of an additional plasmid containing sequences encoding a 
marker protein are co- transfected. Expression of a marker protein provides a means to distinguish 
transfected cells from nontransfected cells and is a reliable predictor of cDNA expression from the 
recombinant vector. Mar to- proteins of choice include, e.g., Green Fluorescent Protein (GFP; 

30 Clontech), CD64, or a CD64-GFP fusion protein. Flow cytometry (FCM), an automated, laser optics- 
based technique, is used to identify transfected cells expressing GFP or CD64-GFP and to evaluate the 
apoptotic state of the cells and other cellular properties. FCM detects and quantifies the uptake of 
fluorescent molecules that diagnose events preceding or coincident with cell death. These events include 
changes in nuclear DNA content as measured by staining of DNA with propidium iodide; changes in 

35 cell size and granularity as measured by forward light scatter and 90 degree side light scatter; down- 
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regulation of DNA synthesis as measured by decrease in bromodeoxyuridine uptake; alterations in 
expression of cell surface and intracellular proteins as measured by reactivity with specific antibodies; 
and alterations in plasma membrane composition as measured by the binding of fluorescein-conjugated 
Annexin V protein to the cell surface. Methods in flow cytometry are discussed in Ormerod, M.G. 

5 ( 1 994) Flow Cytometry , Oxford, New York NY. 

The influence of ORP on gene expression can be assessed using highly purified populations of 
cells transfected with sequences encoding ORP and either CD64 or CD64-GFP. CD64 and CD64-GFP 
are expressed on the surface of transfected cells and bind to conserved regions of human 
immunoglobulin G (IgG). Transfected cells are efficiently separated from nontransfected cells using 

10 magnetic beads coated with either human IgG or antibody against CD64 (DYNAL, Lake Success NY). 
mRNA can be purified from the cells using methods well known by those of skill in the art. Expression 
of mRNA encoding ORP and other genes of interest can be analyzed by northern analysis or microarray 
techniques. 

XIII. Production of ORP Specific Antibodies 

15 ORP substantially purified using polyacrylamide gel electrophoresis (PAGE; see, e.g., 

Harrington, M.G. (1990) Methods Enzymol. 182:488-495), or other purification techniques, is used to 
immunize rabbits and to produce antibodies using standard protocols. 

Alternatively, the ORP amino acid sequence is analyzed using LASERGENE software 
(DNASTAR) to determine regions of high immunogenicity, and a corresponding oligopeptide is 

20 synthesized and used to raise antibodies by means known to those of skill in the art. Methods for 
selection of appropriate epitopes, such as those near the C-terminus or in hydrophilic regions are well 
described in the art (See, e.g., Ausubel, 1995, supra , ch. 11.) 

Typically, oligopeptides of about 15 residues in length are synthesized using an ABI 431 A 
peptide synthesizer (Applied Biosystems) using FMOC chemistry and coupled to KLH (Sigma- Aldrich, 

25 St. Louis MO) by reaction with N-maleimidobenzoyl-N-hydroxysuccinimide ester (MBS) to increase 
immunogenicity. (See, e.g., Ausubd, 1995, supra .) Rabbits are immunized with the oligopeptide-KLH 
complex in complete Freund's adjuvant Resulting antisera are tested for antipeptide and anti-ORP 
activity by, for example, binding the peptide or ORP to a substrate, blocking with 1% BSA, reacting 
with rabbit antisera, washing, and reacting with radio-iodinated goat anti-rabbit IgG. 

30 XIV. Purification of Naturally Occurring ORP Using Specific Antibodies 

Naturally occurring or recombinant ORP is substantially purified by immunoaffinity 
chromatography using antibodies specific for ORP. An immunoaffinity column is constructed by 
covalently coupling anti-ORP antibody to an activated chromatographic resin, such as CNBr-activated 
SEPHAROSE (Amersham Pharmacia Biotech). After the coupling, the resin is blocked and washed 

35 according to the manufacturer's instructions. 
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Media containing ORP are passed over the immunoaffinity column, and the column is washed 
undo* conditions that allow the preferential absorbance of ORP (e.g., high ionic strength buffers in the 
presence of detergent). The column is eluted under conditions that disrupt antibody/ORP binding (e.g., 
a buffer of pH 2 to pH 3, or a high concentration of a chaotrope, such as urea or thiocyanate ion), and 
5 ORP is collected 

XV. Identification of Molecules Which Interact with ORP 

ORP, or biologically active fragments thereof, are labeled with l25 I Bolton-Hunter reagent. 
(See, e.g., Bolton A.E. and W.M. Hunter (1973) Biochem. J. 133:529-539.) Candidate molecules 
previously arrayed in the wells of a multi-well plate are incubated with the labeled ORP, washed, and 
10 any wells with labeled ORP complex are assayed. Data obtained using different concentrations of ORP 
are used to calculate values for the number, affinity, and association of ORP with the candidate 
molecules. 

Alternatively, molecules interacting with ORP are analyzed using the yeast two-hybrid 
system as described in Fields, S. and O. Song (1989, Nature 340:245-246), or using commercially 
15 available kits based on the two-hybrid system, such as the MATCHMAKER system (Clontech). 

ORP may also be used in the PATHCALLING process (CuraGen Corp., New Haven CT) 
which employs the yeast two-hybrid system in a high-throughput manner to determine all interactions 
between the proteins encoded by two large libraries of genes (Nandabalan, K. et al. (2000) U.S. Patent 
No. 6,057,101). 

20 

Various modifications and variations of the described methods and systems of the invention will 
be apparent to those skilled in the art without departing from the scope and spirit of the invention. 
Although the invention has been described in connection with certain embodiments, it should be 
understood that the invention as claimed should not be unduly limited to such specific embodiments. 
25 Indeed, various modifications of the described modes for carrying out the invention which are obvious 
to those skilled in molecular biology or related fields are intended to be within the scope of the following 
claims. 
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What is claimed is: 
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1 . An isolated polypeptide comprising an amino acid sequence selected from the group 
consisting of: 

a) an amino acid sequence selected from the group consisting of SEQ ID NO: 1-27, 

b) a naturally occurring amino acid sequence having at least 90% sequence identity to an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1-27, 

c) a biologically active fragment of an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-27, and 

d) an immunogenic fragment of an amino acid sequence selected from the group consisting 
of SEQ ID NO: 1-27. 

2. An isolated polypeptide of claim 1 selected from the group consisti ng of SEQ ID NO: 1 - 

27. 

3. An isolated polynucleotide encoding a polypeptide of claim 1. 

4. An isolated polynucleotide encoding a polypeptide of claim 2. 

5. An isolated polynucleotide of claim 4 selected from the group consisting of SEQ ID 
NO:28-54. 

6. A recombinant polynucleotide comprising a promoter sequence operably linked to a 
polynucleotide of claim 3. 

7. A cell transformed with a recombinant polynucleotide of claim 6. 

8. A transgenic organism comprising a recombinant polynucleotide of claim 6. 

9. A method for producing a polypeptide of claim 1, the method comprising: 

a) culturing a cell under conditions suitable for expression of the polypeptide, wherein said 
cell is transformed with a recombinant polynucleotide, and said recombinant polynucleotide 
comprises a promote* sequence operably linked to a polynucleotide encoding the polypeptide of claim 
1, and 

b) recovering the polypeptide so expressed. 
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10. An isolated antibody which specifically binds to a polypeptide of claim 1 . 

1 1. An isolated polynucleotide comprising a polynucleotide sequence selected from the 
group consisting of: 

5 a) a polynucleotide sequence selected from the group consisting of SEQ ID NO: 28-54, 

b) a naturally occurring polynucleotide sequence having at least 70% sequence identity to a 
polynucleotide sequence selected from the group consisting of SEQ ID NO:28-54, 

c) a polynucleotide sequence complementary to a), 

d) a polynucleotide sequence complementary to b), and 
10 e) an RNA equivalent of a)-d). 

12. An isolated polynucleotide comprising at least 60 contiguous nucleotides of a 
polynucleotide of claim 1 1 . 

15 1 3. A method for detecting a target polynucleotide in a sample, said target polynucleotide 

having a sequence of a polynucleotide of claim 1 1, the method comprising: 

a) hybridizing the sample with a probe comprising at least 20 contiguous nucleotides 
comprising a sequence complementary to said target polynucleotide in the sample, and which probe 
specifically hybridizes to said target polynucleotide, under conditions whereby a hybridization 

20 complex is formed between said probe and said target polynucleotide or fragments thereof, and 

b) detecting the presence or absence of said hybridization complex, and, optionally, if 
present, the amount thereof. 

14. A method of claim 1 3, wherein the probe comprises at least 60 contiguous nucleotides. 

25 

15. A method for detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 1 1 , the method comprising: 

a) amplifying said target polynucleotide or fragment thereof using polymerase chain reaction 
amplification, and 

30 b) detecting the presence or absence of said amplified target polynucleotide or fragment 

thereof, and, optionally, if present, the amount thereof. 

16. A composition comprising an effective amount of a polypeptide of claim 1 and a 
pharmaceutical^ acceptable excipienL 

35 
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17. A composition of claim 16, wherein the polypeptide comprises an amino acid sequence 
selected from the group consisting of SEQ ID NO; 1-27. 

1 8. A method for treating a disease or condition associated with decreased expression of 

5 functional ORP, comprising administering to a patient in need of such treatment the composition of 
claim 16. 

1 9. A method for screening a compound for effectiveness as an agonist of a polypeptide of 
1 , the method comprising: 

a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 

b) detecting agonist activity in the sample. 

20. A composition comprising an agonist compound identified by a method of claim 19 and 
a pharmaceutically acceptable excipient. 

15 

21 . A method for treating a disease or condition associated with decreased expression of 
functional ORP, comprising administering to a patient in need of such treatment a composition of 
claim 20. 

20 22. A method for screening a compound for effectiveness as an antagonist of a polypeptide 

of claim 1 , the method comprising: 

a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 

b) detecting antagonist activity in the sample. 

25 23. A composition comprising an antagonist compound identified by a method of claim 22 

and a pharmaceutically acceptable excipient. 

24. A method for treating a disease or condition associated with overexpression of functional 
ORP, comprising administering to a patient in need of such treatment a composition of claim 23. 

30 

25. A method of screening for a compound that specifically binds to the polypeptide of claim 
1, said method comprising the steps of: 

a) combining the polypeptide of claim 1 with at least one test compound under suitable 
conditions, and 

35 b) detecting binding of the polypeptide of claim 1 to the test compound, thereby identifying a 
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compound that specifically binds to the polypeptide of claim 1 . 

26. A method of screening for a compound that modulates the activity of the polypeptide of 
claim 1, said method comprising: 

a) combining the polypeptide of claim 1 with at least one test compound under conditions 
permissive for the activity of the polypeptide of claim 1 , 

b) assessing the activity of the polypeptide of claim 1 in the presence of the test compound, 

and 

c) comparing the activity of the polypeptide of claim 1 in the presence of the test compound 
with the activity of the polypeptide of claim 1 in the absence of the test compound, wherein a change 
in the activity of the polypeptide of claim 1 in the presence of the test compound is indicative of a 
compound that modulates the activity of the polypeptide of claim 1 . 

27. A method for screening a compound for effectiveness in altering expression of a target 
polynucleotide, wherein said target polynucleotide comprises a sequence of claim 5, the method 
comprising: 

a) exposing a sample comprising the target polynucleotide to a compound, undo- conditions 
suitable for the expression of the target polynucleotide, 

b) detecting altered expression of the target polynucleotide, and 

c) comparing the expression of the target polynucleotide in the presence of varying amounts of 
the compound and in the absence of the compound. 

28. A method for assessing toxicity of a test compound, said method comprising: 

a) treating a biological sample containing nucleic acids with the test compound; 

b) hybridizing the nucleic acids of the treated biological sample with a probe comprising at 
least 20 contiguous nucleotides of a polynucleotide of claim 1 1 under conditions whereby a specific 
hybridization complex is formed between said probe and a target polynucleotide in the biological 
sample, said target polynucleotide comprising a polynucleotide sequence of a polynucleotide of claim 
1 1 or fragment thereof; 

c) quantifying the amount of hybridization complex; and 

d) comparing the amount of hybridization complex in the treated biological sample with the 
amount of hybridization complex in an untreated biological sample, wherein a difference in the 
amount of hybridization complex in the treated biological sample is indicative of toxicity of the test 
compound. 
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Trp 


Asp 


GlU 


Asp 


Glu 


Gin 


Trp 


Ser Val Val 


Val 








305 










310 






315 


Glu 


Cys 


GlU Asp 


Cys 


Glu 


Leu 


He 


Pro 


Ala 


Asp 


His Val He 


Val 






320 










325 






330 


Thr 


Val 


Ser Leu 


Gly 


Val 


Leu 


Lys 


Arg 


Gin 


Tyr 


Thr Ser Phe 


Phe 








335 










340 






345 


Arg 


Pro 


Gly Leu 


Pro 


Thr 


Glu 


Lys 


Val 


Ala 


Ala 


He His Arg 


Leu 




350 










355 






360 


Gly 


He 


Gly Thr 


Thr 


Asp 


Lys 


He 


Phe 


Leu 


Glu 


Phe Glu Glu 


Pro 






365 










370 






375 


Phe 


Trp 


Gly Pro 


Glu 


Cys 


Asn 


Ser 


Leu 


Gin 


Phe 


Val Trp Glu 


Asp 








380 










385 






390 


Glu 


Ala 


Glu Ser 


His 


Thr 


Leu 


Thr 


Tyr 


Pro 


Pro 


Glu Leu Trp 


Tyr 








395 










400 






405 


Arg 


Lys 


He Cys 


Gly 


Phe 


Asp 


Val 


Leu 


Tyr 


Pro 


Pro Glu Arg 


Tyr 








410 










415 






420 


Gly 


His 


Val Leu 


Ser 


Gly 


Trp 


He 


Cys 


Gly 


Glu 


Glu Ala Leu 


Val 








425 










430 






435 


Met 


Glu 


Lys Cys 


Asp 


Asp 


Glu 


Ala 


Val 


Ala 


Glu 


He Cys Thr 


Glu 








440 










445 






450 


Met 


Leu 


Arg Gin 


Phe 


Thr 


Gly 


Asn 


Pro 


Asn 


He 


Pro Lys Pro 


Arg 








455 










460 






465 


Arg 


He 


Leu Arg 


Ser 


Ala 


Trp 


Gly 


Ser 


Asn 


Pro 


Tyr Phe Arg 


Gly 








470 










475 






480 


Ser 


Tyr 


Ser Tyr 


Thr 


Gin 


Val 


Gly 


Ser 


Ser 


Gly 


Ala Asp Val 


Glu 








485 










490 






495 


Lys 


Leu 


Ala Lys 


Pro 


Leu 


Pro 


Tyr 


Thr 


Glu 


Ser 


Ser Lys Thr 


Ala 








500 










505 






510 


Pro 


Met 


Gin Val 


Leu 


Phe 


Ser 


Gly 


Glu 


Ala 


Thr 


His Arg Lys 


Tyr 








515 










520 






525 


Tyr 


Ser 


Thr Thr 


His 


Gly 


Ala 


Leu 


Leu 


Ser 


Gly 


Gin Arg Glu 


Ala 








530 










535 






540 


Ala 


Arg 


Leu He 


Glu 


Met 


Tyr 


Arg 


Asp 


Leu 


Phe 


Gin Gin Gly 


Thr 








545 










550 






555 
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<211> 337 

<212> PRT 
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<220> 

<221> misc_feature 

<223> Incyte ID No: 13 02741CD1 

<400> 4 



Met 


Ala 


Leu 


Gin 


Thr 


Leu 


Gin 


Ser 


Ser Trp 


Val 


Thr Phe Arg 


Lys 


1 








5 








10 




15 


He 


Leu 


Ser 


His 


Phe 


Pro 


Glu 


Glu 


Leu Ser 


Leu 


Ala Phe Val 


Tyr 










20 








25 






30 


Gly 


Ser 


Gly 


Val 


Tyr 


Arg 


Gin 


Ala 


Gly Pro 


Ser 


Ser Asp Gin 


Lys 










35 








40 






45 


Asn 


Ala 


Met 


Leu 


Asp 


Phe 


Val 


Phe 


Thr Val 


Asp 


Asp Pro Val 


Ala 










50 








55 




60 


Trp 


His 


Ser 


Lys 


Asn 


Leu 


Lys 


Lys 


Asn Trp 


Ser 


His Tyr Ser 


Phe 










65 








70 






75 


Leu 


Lys 


Val 


Leu 


Gly 


Pro 


Lys 


lie 


He Thr 


Ser 


He Gin Asn 


Asn 










80 






85 






90 


Tyr 


Gly 


Ala 


Gly 


Val 


Tyr 


Tyr 


Asn 


Ser Leu 


lie 


Met Cys Asn 


Gly 










95 








100 






105 


Arg 


Leu 


He 


Lys 


Tyr 


Gly 


Val 


He 


Ser Thr 


Asn 


Val Leu He 


Glu 










110 








115 






120 


Asp 


Leu 


Leu 


Asn 


Trp 


Asn 


Asn 


Leu 


Tyr He 


Ala 


Gly Arg Leu 


Gin 
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125 



Lys 


Pro 


vai Lys 


lie ne ber vai Asn 








1 A ft 


ser 


Aia 


Lieu ASp 


Arg Asn Leu Lys ber 








Leu. 


Mec 


Leu Pro 


uiu ber rne ber biu 








Ma 

J. / u 


-Lie 


Aia 


Gly Leu 


ber lyr ber Giy Asp 






1 ct; 

lOD 


giu 


Asp 


Lys Thr 


jjys vai Lieu Asn lie 








9 ftft 


HIS 


pne 


Arg Glu 


Leu ryr Giy ser lie 










vai 


Val 


Tyr Lys 


ber Gin Gin Giy irp 








9*3 ft 


Pro 


G1U 


Gly Gin 


pne Tnr Gin Leu Met 








Gin 


Gin 


Gin lie 


Asn His lie Met Asp 








o c n 


Asp 


vai 


GlU G1U 


Tnr Leu pne Gin vai 






275 


Gly 


Asp 


Val Val 


Arg Leu Gly Leu ser 








290 


Ser 


He 


Arg Gin 


Ser Thr Lys Gly lie 








305 


Lys 


Ser 


Val He 


Tyr Ser Ser Leu Lys 








320 


Gly 


Trp 


Leu Arg 


Lys Thr Ser 



335 



<210> 5 
<211> 109 
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<220> 
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<223> incyte ID No: 1541028CD1 



<400> 5 








Met 


Ser 


Ala 


Asn 


Thr Phe Gly Asn Ala 


1 








5 


Pro 


Gly 


Ala 


Arg 


Leu Glu Gly Arg Cys 










20 


Val 


Arg 


Cys 


His 


Leu Gly Leu Lys He 










35 


Val 


Val 


Gly 


Gly 


Glu Pro Gin Cys Trp 










50 


Leu 


Val 


Asp 


Asp 


Ser Phe Leu His Thr 










65 


Pro 


Glu 


Asp 


Gly 


Pro Arg Val Val Phe 










80 


Pro 


Asn 


Val 


Ala 


Gly Ala Glu Arg Gin 










95 


Ala 


Pro 


Asp 


Pro 
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130 135 



Glu 


Asp 


vai mr 


Leu 


Arg 


14b 








IbU 


Ala 

Aia 


vai 


inr Aia 


Aia 


pne 


lOU 








ICC 

lob 


LrlU 


Asp 


LeU rne 


lie 


GlU 


1TC 

1/3 






lo u 


pne 


Arg 


Men vai 


vai 


Giy 


1 y U 








13b 


vai 


Lys 


Pro Asn 


Ti- 
ne 


Ala 


9 








91 ft 
Z J. u 


Leu 


ply, 

Gin 


Glu Asn 


Pro 


Gin 










Z A D 


Leu 


GlU 


He Asp 


Lys 


Ser 


Z J D 








Z4 U 


Thr 


Leu 


Pro Lys 


Thr 


Leu 


O CO 

zbU 






zbb 


Pro 


Pro 


Gly Lys 


Asn 


Arg 


^ u j 








270 


Ala 


His 


Asp Pro 


Asp 


Cys 


280 








285 


Ala 


He 


Val Arg 


Pro 


Ser 


295 








300 


Phe 


Thr 


Ala Gly 


Leu 


Lys 


310 








315 


Leu 


His 


Lys Met 


Trp 


Lys 



325 330 



Gly 


Phe 


Ser Val 


Leu 


Leu 


10 








15 


Gly 


Pro 


Thr Asn 


Ala 


Arg 


25 








30 


Pro 


Pro 


Gly Cys 


Glu 


Leu 


40 








45 


Ala 


Glu 


Gly His 


Cys 


Leu 


55 








60 


Val 


Ala 


His Asn 


Gly 


Ser 


70 






75 


He 


Val 


Asp Leu 


Trp 


His 


85 






90 


Ala 


Leu 


Asp Phe 


Val 


Phe 



100 105 
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<211> 385 
<212> PRT 

<213> Homo sapiens 
<220> 
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<223> Incyte ID No: 



<400> 6 






Met 


Lys 


Met Leu 


Pro 


1 




5 


Ala 


Arg 


Val Leu 


Val 






20 


Glu 


Ala 


Leu Trp 


Gly 






35 


Glu 


Glu 


Met Asn 


He 








50 


Leu 


Leu 


His Gin 


Asp 








65 


Pro 


Leu 


Thr Arg 


Gin 








80 


Asn 


Val 


Val Cys 


Glu 






95 


Met 


Val 


Thr Ala 


Ser 








110 


Gly 


Asn 


Val Leu 


Arg 








125 


Leu 


He 


Ser Glu 


His 








140 


Arg 


He 


Tyr Ser 


Gly 








155 


Cys 


Asp 


Glu Leu 


Met 








170 


Tyr 


He 


Val Asp 


Leu 








185 


Lys 


Val 


His Gly 


Leu 








200 


He 


Arg 


Gly He 


Arg 








215 


Gin 


Met 


Leu Met 


Gly 








230 


Phe 


Asn 


Met Pro 


Gly 








245 


Ser 


Ala 


Gly Arg 


Leu 








260 


Lys 


Asn 


Ser Ala 


Thr 








27 5 


Ala 


Val 


Gly Ala 


Gly 








290 


Leu 


Leu 


Tyr Leu 


Lys 








305 


Gin 


Ser 


Phe Gin 


Gly 








320 


Pro 


Val 


Ser Met 


Ala 








335 


Ser 


Val 


Val Asp 


Ala 






350 


Glu 


Ala 


Val Glu 


Val 








365 


Leu 


Cys 


Glu Ala 


Leu 








380 



1597687CD1 



Gly 


Val 


Gly 


Val Pne 






10 


Pro 


Leu 


Leu 


Arg Ala 
25 


Lys 


Thr 


Glu 


GlU GlU 






40 


Ala 


Phe 


Tyr 


Thr Ser 






55 


Val 


Asp 


Leu 


Val Cys 
70 


He 


Ser 


Val 


Lys Ala 
85 


Lys 


Ala 


Ala 


Thr Ser 
100 


Arg 


Tyr 


Tyr 


Pro Gin 
115 


Phe 


Leu 


Pro 


Ala Phe 
130 


Tyr 


Val 


Gly 


Ala Val 
145 


Ser 


Leu 


Leu 


Ser Pro 
160 


Gly 


Gly 


Gly 


Gly Leu 
175 


Leu 


Thr 


His 


Leu Thr 
190 


Leu 


Lys 


Thr 


Phe Val 






205 


His 


Val 


Thr 


Ser Asp 
220 


Gly 


Gly 


Val 


Cys Ser 
235 


Ala 


Phe 


Val 


His Glu 
250 


Val 


Ala 


Arg 


Gly Ala 
265 


Gin 


Glu 


Glu 


Leu Leu 
280 


Leu 


Pro 


Glu 


Gin Gly 
295 


Gly 


Met 


Val 


Tyr Met 
310 


Gin 


Gly 


Asp 


Arg Arg 
325 


Ala 


Ser 


Phe 


Glu Asp 
340 


He 


Lys 


Arg 


Ser Ser 
355 


Leu 


Thr 


Glu 


Glu Pro 
370 


Gin 


Arg 


Asn 


Asn Leu 
385 



Gly 


Thr 


Gly 


Ser 


Ser 
15 


Glu 


Gly 


Phe 


Thr 


Val 
30 


Ala 


Lys 


Gin 


Leu 


Ala 
45 


Arg 


Thr 


Asp 


Asp 


He 
60 


He 


Ser 


He 


Pro 


Pro 
75 


Leu 


Gly 


lie 


Gly 


Lys 
90 


Val 


Asp 


Ala 


Phe 


Arg 
105 


Leu 


Met 


Ser 


Leu 


Val 
120 


Val 


Arg 


Met 


Lys 


Gin 
135 


Met 


He 


Cys 


Asp 


Ala 
150 


Ser 


Tyr 


Gly 


Trp 


He 
165 


His 


Thr 


Met 


Gly 


Thr 
180 


Gly 


Arg 


Arg 


Ala 


Glu 
195 


Arg 


Gin 


Asn 


Ala 


Ala 
210 


Asp 


Phe 


Cys 


Phe 


Phe 
225 


Thr 


Val 


Thr 


Leu 


Asn 
240 


Val 


Met 


Val 


Val 


Gly 
255 


Asp 


Leu 


Tyr 


Gly 


Gin 
270 


Leu 


Arg 


Asp 


Ser 


Leu 
285 


Pro 


Gin 


Asp 


Val 


Pro 
300 


Val 


Gin 


Ala 


Leu 


Arg 
315 


Thr 


Trp 


Asp 


Arg 


Thr 
330 


Gly 


Leu 


Tyr 


Met 


Gin 
345 


Arg 


Ser 


Gly 


Glu 


Trp 
360 


Asp 


Thr 


Asn 


Gin 


Asn 
375 
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<400> 7 



Met 


Ala 


Cys Ala 


Ala 


•Val 


Met 


1 




5 






Val 


Gly 


Ala He 


Arg 


He 


Glu 






20 






Ser 


Thr 


Leu Arg 


His 


Leu 


Thr 






35 






Arg 


Lys 


Thr Asp 


His 


Met 


Glu 








50 






Glu 


lie 


Val Ala 


Ala 


Ala 


Lys 








65 






Pro 


Ser 


Val Lys 


Ser 


Leu 


Arg 








80 






Ser 


Phe 


Lys Ala 


Gly 


Gin 


Trp 








95 






Ser 


Val 


Val Gly 


Gly 


Phe 


ser 








110 






GlU 


Gin 


Glu Arg 


Val 


He 


Glu 






125 






Pro 


Pro 


Ala Leu 


Trp 


Val 


His 








140 






Val 


Ala 


Val Arg 


Val 


Gly 


Gly 








155 






Ala 


Asp 


Ala Ser 


Arg 


Asn 


Leu 






170 






lie 


Asn 


Pro Leu 


Leu 


Ser 


He 








185 






Arg 


Glu 


Gin Ala 


Asn 


Lys 


Arg 








200 






Lys 


Leu 


Phe Tyr 


Ser 


Ala 


Lys 








215 






Lys 


Asn 


He Leu 


Asp 


Leu 


Val 








230 






Cys 


Ser 


Leu His 


Val 


Thr 


Lys 








245 






Leu 


Lys 


Pro Tyr 


He 


Thr 


Glu 






260 






Arg 


Asp 


His He 


Ser 


Lys 


Glu 








275 






Pro 


Pro 


Met Thr 


Asp 


Phe 


Phe 








290 






Val 


Pro 


Lys Glu 


His 


He 


Cys 



305 



He Pro Gly Leu Leu Arg Cys 


Ser 




10 




15 


Ala Ala 


Ser 


Leu Arg Leu Thr 


Leu 




25 




30 


Leu Thr 


Ser 


He Met Lys Ser 


Lys 




40 




45 


Arg Thr Ala Ser Val Leu Arg 


Arg 




55 




60 


Val Cys Gly Ala Ala Ser Glu 


Ser 




70 




75 


Leu Leu 


Val 


Ala Asp Gin Asp 


Phe 




85 




90 


Val Asp 


Phe 


Phe He Pro Gly 


Val 




100 




105 


He Cys 


Ser 


Ser Pro Arg Leu 


Leu 




115 




120 


Leu Ala 


Val 


Lys Tyr Thr Asn 


His 




130 




135 


Asn Thr 


Cys 


Thr Leu Asp Cys 


Glu 




145 




150 


Glu Phe 


Phe 


Phe Asp Pro Gin 


Pro 




160 




165 


Val Leu 


He Ala Gly Gly Val 


Gly 




175 




180 


Leu Arg 


His 


Ala Ala Asp Leu 


Leu 




190 




195 


Asn Gly Tyr Glu He Gly Thr 


He 




205 




210 


Asn Thr 


Ser 


Glu Leu Leu Phe 


Lys 




220 




225 


Asn Glu 


Phe 


Pro Glu Lys He 


Ala 




235 




240 


Gin Thr 


Thr 


Gin He Asn Ala 


Glu 




250 




255 


Gly Arg 


He 


Thr Glu Lys Glu 


He 




265 




270 


Thr Leu 


Phe 


Tyr He Cys Gly 


Pro 




280 




285 


Ser Lys 


Gin 


Leu Glu Asn Asn 


His 


295 




300 


Phe Glu 


Lys 


Trp Trp 






310 
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<220> 

<221> misc_feature 

<223> Incyte ID No: 1865603CD1 

<400> 8 



Met 


Leu 


Arg His 


Leu 


Pro Ser Arg Leu Pro 


Val 


Lys Met Trp 


Gly 


1 








5 


10 






15 


Arg 


Thr 


Leu 


Glu 


Lys 


Gin Ser Trp Arg Asp 


Ser 


Ser Gin Thr 


Pro 










20 


25 






30 


Pro 


Pro 


Cys 


Leu 


He 


Arg Arg Leu Asp His 


He 


Val Met Thr 


Val 










35 


40 






45 


Lys 


Ser 


He 


Lys 


Asp 


Thr Thr Met Phe Tyr 


Ser 


Lys He Leu 


Gly 










50 


55 






60 


Met 


Glu 


Val 


Met 


Thr 


Phe Lys Glu Asp Arg Lys Ala Leu Cys 


Phe 










65 


70 






75 
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Gly Asp 


Gin Lys 


Phe 


Asn 


Leu 


His Glu Val Gly Lys Glu Phe Glu 








80 








85 




90 


Pro 


Lys 


Ala Ala 


His 


Pro 


Val 


Pro Gly 


Ser 


Leu Asp 


He Cys Leu 








95 








100 




105 


lie 


Thr 


Glu Val 


Pro 


Leu 


Glu 


Glu Met 


He 


Gin His 


Leu Lys Ala 








110 








115 




120 


cys 


Asp 


Val Pro 


lie 


Glu 


GlU 


Gly Pro 


Val 


Pro Arg 


Thr Gly Ala 




125 








130 




135 


Lys 


Gly 


Pro lie 


Met 


Ser 


He 


Tyr Phe 


Arg 


Asp Pro 


Asp Arg Asn 








140 








145 




150 


Leu 


lie 


Glu Val 


Ser Asn Tyr 


He Ser 


Ser 












155 








160 
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<220> 

<221> misc_feature 

<223> Incyte ID No: 1976472CD1 

<400> 9 



Met 


Asp 


Trp 


He 


Arg 


Leu 


Thr 


Lys Ser Gly Lys Asp Leu Thr 


Gly 


1 








5 






10 


15 


Leu 


Lys 


Gly 


Arg 


Leu 


He 


Glu 


Val Thr Glu Glu Glu Leu Lys 


Lys 








20 






25 


30 


His 


Asn 


Lys 


Lys 


Asp Asp 


Cys 


Trp He Cys He Arg Gly Phe Val 










35 






40 


45 


Tyr Asn 


Val 


Ser 


Pro 


Tyr 


Met 


Glu Tyr His Pro Gly Gly Glu 


Asp 










50 






55 


60 


Glu 


Leu 


Met 


Arg 


Ala 


Ala 


Gly 


Ser Asp Gly Thr Glu Leu Phe Asp 










65 






70 


75 


Gin 


Val 


His 


Arg 


Trp 


Val 


Asn 


Tyr Glu Ser Met Leu Lys Glu 


Cys 










80 






85 


90 


Leu 


Val 


Gly 


Arg 


Met 


Ala 


He 


Lys Pro Ala Val Leu Lys Asp 


Tyr 










95 






100 


105 


Arg Glu 


Glu 


Glu 


Lys 


Lys 


Val 


Leu Asn Gly Met Leu Pro Lys 


Ser 










110 






115 


120 


Gin 


Val 


Thr 


Asp 


Thr 


Leu 


Ala 


Lys Glu Gly Pro Ser Tyr Pro 


Ser 








125 






130 


135 


Tyr 


Asp 


Trp 


Phe 


Gin 


Thr 


Asp 


Ser Leu Val Thr He Ala He 


Tyr 










140 






145 


150 


Thr 


Lys 


Gin 


Lys 


Asp 


He 


Asn 


Leu Asp Ser He He Val Asp 


His 










155 






160 


165 


Gin 


Asn 


Asp 


Ser 


Phe 


Arg 


Ala 


Glu Thr He He Lys Asp Cys 


Leu 








170 






175 


180 


Tyr 


Leu 


He 


His 


He Gly 


Leu 


Ser His Glu Val Gin Glu Asp Phe 








185 






190 


195 


Ser 


Val 


Arg 


Val 


Val 


Glu 


Ser 


Val Gly Lys He Glu He Val 


Leu 








200 






205 


210 


Gin Lys 


Lys 


Glu 


Asn 


Thr 


Ser 


Trp Asp Phe Leu Gly His Pro 


Leu 










215 






220 


225 


Lys 


Asn 


His 


Asn 


Ser 


Leu 


He 


Pro Arg Lys Asp Thr Gly Leu 


Tyr 










230 






235 


240 


Tyr Arg 


Lys 


Cys 


Gin 


Leu 


He 


Ser Lys Glu Asp Val Thr His 


Asp 










245 






250 


255 


Thr 


Arg 


Leu 


Phe 


Cys 


Leu 


Met 


Leu Pro Pro Ser Thr His Leu 


Gin 








260 






265 


270 


Val 


Pro 


He 


Gly 


Gin 


His 


Val 


Tyr Leu Lys Leu Pro He Thr 


Gly 








275 






280 


285 


Thr 


Glu 


He 


Val 


Lys 


Pro 


Tyr 


Thr Pro Val Ser Gly Ser Leu 


Leu 










290 






295 


300 


Ser 


Glu 


Phe 


Lys 


Glu 


Pro 


Val 


Leu Pro Asn Asn Lys Tyr He Tyr 
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305 






310 


315 


Phe 


Leu 


He Lys 


He 


Tyr 


Pro 


Thr Gly Leu Phe Thr Pro Glu 


Leu 






320 




325 


330 


Asp 


Arg 


Leu Gin 


He 


Gly 


Asp 


Phe Val Ser Val Ser Ser Pro 


Glu 








335 




340 


345 


Gly 


Asn 


Phe Lys 


He 


Ser 


Lys 


Phe Gin Glu Leu Glu Asp Leu 


Phe 






350 




355 


360 


Leu 


Leu 


Ala Ala 


Gly 


Thr 


Gly 


Phe Thr Pro Met Val Lys He 


Leu 








365 






370 


375 


Asn 


Tyr 


Ala Leu 


Thr 


Asp 


He 


Pro Ser Leu Arg Lys Val Lys 


Leu 








380 






385 


390 


Met 


Phe 


Phe Asn 


Lys 


Thr 


Glu 


Asp Asp He He Trp Arg Ser 


Gin 








395 






400 


405 


Leu 


Glu 


Lys Leu 


Ala 


Phe 


Lys 


Asp Lys Arg Leu Asp Val Glu 


Phe 








410 






415 


420 


Val 


Leu 


Ser Ala 


Pro 


He 


Ser 


Glu Trp Asn Gly Lys Gin Gly 


His 








425 






430 


435 


lie 


ser 


Pro Ala 


Leu 


Leu 


Ser 


Glu Phe Leu Lys Arg Asn Leu 


Asp 








440 






445 


450 


Lys 


Ser 


Lys Val 


Leu 


Val 


Cys 


He Cys Gly Pro Val Pro Phe 


Thr 








455 






460 


465 


Glu 


Gin 


Gly Val 


Arg 


Leu 


Leu 


His Asp Leu Asn Phe Ser Lys 


Asn 






470 






475 


480 


Glu 


He 


His Ser 


Phe 


Thr 


Ala 












485 
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<223> Incyte ID No: 2050821CD1 

<400> 10 



Met 


Ser 


Leu 


Leu 


Ser 


Leu 


Pro 


Trp Leu Gly Leu Arg Pro Val 


Ala 


1 








5 








10 


15 


Met 


ser 


Pro 


Trp 


Leu 


Leu 


Leu 


Leu Leu Val Val Gly Ser Trp 


Leu 










20 








25 


30 


Leu 


Ala 


Arg 


He 


Leu 


Ala 


Trp 


Thr Tyr Ala Phe Tyr Asn Asn 


Cys 










35 








40 


45 


Arg 


Arg 


Leu 


Gin 


Cys 


Phe 


Pro 


Gin Pro 


Pro Lys Arg Asn Trp 


Phe 










50 








55 


60 


Trp 


Gly 


His 


Leu 


Gly 


Leu 


He 


Thr Pro 


Thr Glu Glu Gly Leu 


Lys 










65 








70 


75 


Asp 


Ser 


Thr 


Gin 


Met 


Ser 


Ala 


Thr Tyr 


Ser Gin Gly Phe Thr 


Val 










80 








85 


90 


Trp 


Leu 


Gly 


Pro 


He 


He 


Pro 


Phe He 


Val Leu Cys His Pro 


Asp 










95 








100 


105 


Thr 


He 


Arg 


Ser 


He 


Thr 


Asn 


Ala Ser 


Ala Ala He Ala Pro 


Lys 










110 








115 


120 


Asp 


Asn 


Leu 


Phe 


He 


Arg 


Phe 


Leu Lys 


Pro Trp Leu Gly Glu 


Gly 










125 








130 


135 


He 


Leu 


Leu 


Ser 


Gly 


Gly 


Asp 


Lys Trp 


Ser Arg His Arg Arg 


Met 










140 








145 


150 


Leu 


Thr 


Pro 


Ala 


Phe 


His 


Phe 


Asn He 


Leu Lys Ser Tyr He 


Thr 










155 








160 


165 


He 


Phe 


Asn 


Lys 


Ser 


Ala 


Asn 


He Met 


Leu Asp Lys Trp Gin 


His 










170 








175 


180 


Leu 


Ala 


Ser 


Glu 


Gly 


Ser 


Ser 


Arg Leu 


Asp Met Phe Glu His 


He 










185 








190 


195 


Ser 


Leu 


Met 


Thr 


Leu 


Asp 


Ser 


Leu Gin 


Lys Cys He Phe Ser 


Phe 










200 








205 


210 
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Asp 


ser 


His 


Cys 


Gin 


Glu 


Arg 










215 






Leu 


Glu 


Leu 


ser 


Ala 


Leu 


Val 










230 






Gin 


His 


Met Asp 


Phe 


Leu 


Tyr 










245 






Phe 


His 


Arg Ala 


Cys 


Arg 


Leu 










260 






He 


Arg 


Glu Arg 


Arg 


Arg 


Thr 










275 






Phe 


Phe 


Lys 


Asp 


Lys 


Ala 


Lys 










290 






Val 


Leu 


Leu 


Leu 


Ser 


Lys 


Asp 










305 






Glu 


Asp 


He Arg 


Ala 


Glu 


Ala 










320 






Asp 


Thr 


Thr 


Ala 


Ser 


Gly 


Leu 










335 






Arg 


His 


Pro 


Glu 


Tyr 


Gin 


GlU 










350 






Leu 


Leu 


Lys 


Asp 


Arg 


Asp 


Pro 










365 






Ala 


Gin 


Leu 


Pro 


Phe 


Leu 


Thr 










380 






Leu 


His 


Pro 


Pro 


Ala 


Pro 


Phe 










395 






He 


Val 


Leu 


Pro 


Asp 


Gly 


Arg 










410 






Leu 


He 


Asp 


He 


He 


Gly 


Val 










425 






Asp 


Pro 


Glu 


Val 


Tyr 


Asp 


Pro 










440 






Lys 


Gly 


Arg 


Ser 


Pro 


Leu 


Ala 










455 






Arg 


Asn 


Cys 


He 


Gly 


Gin 


Ala 










470 






Val 


Leu 


Ala 


Leu 


Met 


Leu 


Leu 










485 






Thr 


Glu 


Pro Arg 


Arg 


Lys 


Leu 










500 






Gly 


Leu 


Trp 


Leu 


Arg 


Val 


Glu 



515 



Pro 


Ser 


Glu 


Tyr 


He Ala Thr 


He 






220 




225 


Glu 


Lys 


Arg 


Ser 


Gin His He 


Leu 






235 






240 


Tyr 


Leu 


Ser 


His Asp Gly Arg 


Arg 






250 






255 


Val 


His 


Asp 


Phe Thr Asp Ala 


Val 






265 






270 


Leu 


Pro 


Thr 


Gin Gly He Asp 


Asp 






280 






285 


ser 


Lys 


Thr 


Leu 


Asp Phe He 


Asp 






295 






300 


Glu 


Asp 


Gly 


Lys 


Ala Leu Ser 


Asp 






310 






315 


Asp 


Thr 


Phe 


Met 


Phe Gly Gly 


His 






325 






330 


Ser 


Trp Val 


Leu Tyr Asn Leu 


Ala 






340 






345 


Arg 


Cys 


Arg 


Gin 


Glu Val Gin 


Glu 






355 






360 


Lys 


Glu 


He 


Glu 


Trp Asp Asp 


Leu 






370 






375 


Met 


Cys 


Val 


Lys 


Glu Ser Leu 


Arg 






385 






390 


He 


Ser 


Arg 


Cys 


Cys Thr Gin 


Asp 






400 






405 


Val 


He 


Pro 


Lys 


Gly He Thr 


Cys 






415 






420 


His 


His 


Asn 


Pro 


Thr Val Trp 


Pro 






430 






435 


Phe 


Arg 


Phe 


Asp 


Pro Glu Asn 


Ser 






445 






450 


Phe 


He 


Pro 


Phe 


Ser Ala Gly 


Pro 






460 




465 


Phe 


Ala 


Met 


Ala 


Glu Met Lys 


Val 






475 




480 


His 


Phe 


Arg 


Phe 


Leu Pro Asp 


His 






490 






495 


Glu 


Leu 


He 


Met Arg Ala Glu 


Gly 






505 






510 


Pro 


Leu 


Asn 


Val 


Gly Leu Gin 








520 









<210> 11 

<211> 144 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 2408443CD1 

<400> 11 



Met 


Val 


Thr 


Leu 


Tyr 


Ser 


Ser Pro 


Ser 


Cys 


Thr Ser 


Cys Arg 


Lys 


1 








5 








10 






15 


Ala 


Lys 


Gin 


Trp 


Leu 


Val 


Asp His 


Asn 


Leu 


Pro Phe 


He Glu 


Arg 










20 








25 






30 


Asn 


Leu 


Asn 


Lys 


Glu 


Pro 


Leu Arg 


Ala 


GlU 


Asp Val 


Lys Ala 


Met 










35 






40 




45 


Leu 


Arg 


Leu 


Thr 


Glu 


Asp 


Gly Thr 


Glu 


Glu 


Leu He 


Ser Thr 


Arg 










50 








55 






60 


Ser 


Lys 


He 


Phe 


ser 


GlU 


Leu Thr 


He Asp 


Leu Asp Asp Met 


Ser 










65 








70 






75 


He 


Asn 


Lys 


Leu 


He 


Asp 


Leu He 


Val 


Met 


Tyr Pro 


Ser Leu 


Leu 
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80 85 90 

Lys Arg Pro lie He Leu Asp Asp Gin Arg Met Gin He Gly Tyr 
95 100 105 

Asn Asp Asp Glu lie Arg Arg Phe Leu Pro Arg Glu Val Arg Gin 
110 115 120 

Arg Glu Leu He Arg Ala Thr Phe Lys Ala Asp Phe Ala Glu Glu 
125 130 135 

Ala Lys Asp Leu Val Val Glu Glu Gly 
140 



<210> 12 

<211> 373 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 2508668CD1 

<400> 12 



Met 


Trp 


Val 


Cys 


Ser 


Thr 


Leu 


Trp Arg 


Val 


Arg Thr Pro Ala Arg 


1 








5 








10 


15 


Gin 


Trp 


Arg 


Gly 


Leu 


Leu 


Pro 


Ala Ser 


Gly 


Cys His Gly Pro Ala 










20 








25 


30 


Ala 


Ser 


Ser 


Tyr 


Ser 


Ala 


Ser 


Ala Glu 


Pro 


Ala Arg Val Arg Ala 








35 








40 


45 


Leu 


Val 


Tyr 


Gly 


His 


His 


Gly 


Asp Pro 


Ala 


Lys Val Val Glu Leu 










50 








55 


60 


Lys 


Asn 


Leu 


Glu 


Leu 


Ala 


Ala 


Val Arg 


Gly 


Ser Asp Val Arg Val 










65 








70 


75 


Lys 


Met 


Leu 


Ala 


Ala 


Pro 


He 


Asn Pro 


Ser 


Asp He Asn Met He 








80 








85 


90 


Gin 


Gly 


Asn 


Tyr 


Gly 


Leu 


Leu 


Pro Glu 


Leu 


Pro Ala Val Gly Gly 








95 








100 


105 


Asn 


Glu 


Gly 


Val 


Ala 


Gin 


Val 


Val Ala 


Val 


Gly Ser Asn Val Thr 








110 








115 


120 


Gly 


Leu 


Lys 


Pro 


Gly 


Asp 


Trp 


Val He 


Pro 


Ala Asn Ala Gly Leu 










125 








130 


135 


Gly 


Thr 


Trp 


Arg 


Thr 


Glu 


Ala 


Val Phe 


Ser 


Glu Glu Ala Leu He 










140 








145 


150 


Gin 


Val 


Pro 


Ser 


Asp 


He 


Pro 


Leu Gin 


Ser 


Ala Ala Thr Leu Gly 










155 








160 


165 


Val 


Asn 


Pro 


Cys 


Thr 


Ala 


Tyr 


Arg Met 


Leu 


Met Asp Phe Glu Gin 








170 




175 


180 


Leu 


Gin 


Pro 


Gly 


Asp 


Ser 


Val 


He Gin 


Asn 


Ala Ser Asn Ser Gly 










185 








190 


195 


Val 


Gly 


Gin 


Ala 


Val 


He 


Gin 


He Ala 


Ala 


Ala Leu Gly Leu Arg 










200 








205 


210 


Thr 


He 


Asn 


Val 


Val 


Arg 


Asp 


Arg Pro 


Asp 


He Gin Lys Leu Ser 










215 








220 


225 


Asp 


Arg 


Leu 


Lys 


Ser 


Leu 


Gly 


Ala Glu 


His 


Val He Thr Glu Glu 








230 






235 


240 


Glu 


Leu 


Arg 


Arg 


Pro 


Glu 


Met 


Lys Asn 


Phe 


Phe Lys Asp Met Pro 










245 








250 


255 


Gin 


Pro 


Arg 


Leu 


Ala 


Leu 


Asn 


Cys Val 


Gly 


Gly Lys Ser Ser Thr 










260 








265 


270 


Glu 


Leu 


Leu 


Arg 


Gin 


Leu 


Ala 


Arg Gly 


Gly 


Thr Met Val Thr Tyr 










275 








280 


285 


Gly 


Gly 


Met 


Ala 


Lys 


Gin 


Pro 


Val Val 


Ala 


Ser Val Ser Leu Leu 






290 








295 


300 


He 


Phe 


Lys 


Asp 


Leu 


Lys 


Leu 


Arg Gly 


Phe 


Trp Leu Ser Gin Trp 










305 








310 


315 


Lys 


Lys 


Asp 


His 


Ser 


Pro 


Asp 


Gin Phe 


Lys 


Glu Leu He Leu Thr 










320 








325 


330 
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Leu Cys Asp Leu lie Arg Arg Gly Gin Leu Thr Ala Pro Ala Cys 

335 340 345 

Ser Gin Val Pro Leu Gin Asp Tyr Gin Ser Ala Leu Glu Ala Ser 

350 355 360 

Met Lys Pro Phe lie Ser Ser Lys Gin lie Leu Thr Met 
365 370 



<210> 13 

<211> 305 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 253683 0CD1 

<400> 13 



Met 


Ala 


Lys Phe 


Val 


He 


Ala 


Gly 


1 






5 








Ala 


Lys 


Thr Glu 


Leu 


Val 


Ala 


Asp 








20 








Asp 


Phe 


Arg He 


His 


Lys 


He 


Thr 








35 








Asp 


Trp 


Leu Lys 


Asp 


Val 


Cys 


Glu 








50 








Asn 


Ser 


Pro He 


He 


Trp 


Arg 


Glu 








65 








Gly 


Leu 


Leu Leu 


Gly 


Gly 


Tyr 


Asn 








80 








Leu 


Tyr 


Tyr Asp 


Val 


Thr 


Ser 


Ser 








95 








Val 


He 


Ala Gin 


Glu 


Asn 


Leu 


Gly 








110 








Glu 


Glu 


Glu Ala 


Leu 


Lys 


Thr 


Cys 








125 






He 


Thr 


Ser Ala 


Ser 


Ala 


Pro 


Ala 








140 








Leu 


Thr 


Ser Gly 


Glu 


Val 


Phe 


Gly 






155 






Thr 


Leu 


Phe Asp 


Asn 


Lys 


Gin 


Ala 








17 0 








Val 


Val 


Glu Thr 


Gin 


Asp 


Leu 


Ala 








185 








ser 


He 


Cys Thr 


Lys 


Val 


Glu 


Glu 








200 








He 


Val 


Val Leu 


Asp 


Asp 


Ser 


Thr 








215 








Glu 


Asp 


Cys Leu 


Arg 


Ser 


Arg 


Val 








230 








Tyr 


Leu 


He Glu 


Lys 


Asn 


Ala 


His 








245 








Gly 


Gly 


Arg Thr 


Phe 


Val 


Asn 


Leu 








260 








Tyr 


Ala 


Pro Arg 


He 


Ala 


His 


Asn 




275 








Val 


Glu 


Gly Glu 


Ala 


Lys 


Ala 


He 








290 








Ala 


Pro 


Ser Cys 


Glu 









305 



Arg 


Ala 


Asp 


Cys Pro 


Tyr Tyr 




10 






15 


Tyr 


Leu 


Gin Lys Asn 


Leu Pro 




25 






30 


Gin 


Arg 


Pro 


Glu Val 


Trp Glu 




40 






45 


Lys 


Asn 


Lys 


Trp ser 


His Lys 




55 






60 


Leu 


Leu 


Asp Arg Gly Gly Lys 




70 






75 


Glu 


Phe 


Leu 


Glu His 


Ala Gin 




85 






90 


Met 


Thr 


Thr 


Glu Leu 


Met Met 




100 






105 


Ala 


His 


He 


Glu Lys 


Glu Gin 




115 






120 


He 


Asn 


Pro 


Leu Gin 


Val Trp 




130 






135 


Cys 


Tyr 


Asn 


Leu He 


Pro He 




145 






150 


Met 


His 


Thr 


Glu He 


Ser He 




160 






165 


Glu 


Glu 


His 


Leu Lys 


Ser Leu 




175 






180 


Ser 


Pro 


Val 


Leu Arg 


Ser Val 




190 






195 


Ala 


Phe 


Arg Gin Ala His Val 




205 






210 


Asn Lys 


Glu 


Val Phe 


Thr Leu 




220 






225 


Pro 


Leu 


Cys 


Arg Leu Tyr Gly 




235 






240 


Glu 


Ser 


Val Arg Val 


He Val 




250 






255 


Lys 


Thr 


Val 


Leu Leu Met Arg 




265 






270 


He 


He 


Ala Val Ala Leu Gly 




280 






285 


Leu 


Ala 


Arg 


Lys Leu 


Lys Thr 




295 






300 



<210> 14 
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<211> 500 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2645179CD1 

<400> 14 



Met 


Glu 


Ala 


Ala 


Arg 


Pro 


Pro Pro Thr Ala Gly Lys Phe Val 


Val 


1 








5 




10 


15 


Val 


Gly 


Gly 


Gly 


He 


Ala 


Gly Val Thr Cys Ala Glu Gin Leu 


Ala 










20 




25 


30 


Thr 


His 


Phe 


Pro 


Ser 


Glu 


Asp He Leu Leu Val Thr Ala Ser 


Pro 










35 




40 


45 


Val 


He 


Lys 


Ala 


Val 


Thr 


Asn Phe Lys Gin He Ser Lys He 


Leu 








50 




55 


60 


Glu 


Glu 


Phe 


Asp 


Val 


Glu 


Glu Gin Ser Ser Thr Met Leu Gly 


Lys 










65 




70 


75 


Arg 


Phe 


Pro 


Asn 


He 


Lys 


Val He Glu Ser Gly Val Lys Gin 


Leu 










80 




85 


90 


Lys 


ser 


Glu 


Glu 


His 


cys 


He Val Thr Glu Asp Gly Asn Gin 


His 








95 




100 


105 


Val 


Tyr 


Lys 


Lys 


Leu 


Cys 


Leu Cys Ala Gly Ala Lys Pro Lys 


Leu 










110 




115 


120 


He 


Cys 


Glu 


Gly 


Asn 


Pro 


Tyr Val Leu Gly He Arg Asp Thr 


Asp 










125 




130 


135 


Ser 


Ala 


Gin 


Glu 


Phe 


Gin 


Lys Gin Leu Thr Lys Ala Lys Arg 


He 










140 




145 


150 


Met 


He 


He 


Gly 


Asn 


Gly 


Gly He Ala Leu Glu Leu Val Tyr 


Glu 










155 




160 


165 


He 


Glu 


Gly 


Cys 


Glu 


Val 


He Trp Ala He Lys Asp Lys Ala 


He 










170 




175 


180 


Gly 


Asn 


Thr 


Phe 


Phe 


Asp 


Ala Gly Ala Ala Glu Phe Leu Thr 


Ser 








185 


190 


195 


Lys 


Leu 


He 


Ala 


Glu 


Lys 


Ser Glu Ala Lys He Ala His Lys 


Arg 










200 




205 


210 


Thr 


Arg 


Tyr 


Thr 


Thr 


Glu 


Gly Arg Lys Lys Glu Ala Arg Ser 


Lys 










215 




220 


225 


Ser 


Lys 


Ala 


Asp 


Asn 


Val 


Gly Ser Ala Leu Gly Pro Asp Trp 


His 










230 




235 


240 


Glu 


Gly 


Leu 


Asn 


Leu 


Lys 


Gly Thr Lys Glu Phe Ser His Lys 


He 










245 




250 


255 


His 


Leu 


GlU 


Thr 


Met 


Cys 


Glu Val Lys Lys He Tyr Leu Gin 


Asp 










260 




265 


270 


Glu 


Phe 


Arg 


He 


Leu 


Lys 


Lys Lys Ser Phe Thr Phe Pro Arg 


Asp 










275 




280 


285 


His 


Lys 


Ser 


Val 


Thr 


Ala 


Asp Thr Glu Met Trp Pro Val Tyr 


Val 










290 




295 


300 


GlU 


Leu 


Thr 


Asn 


Glu 


Lys 


He Tyr Gly Cys Asp Phe He Val 


Ser 










305 




310 


315 


Ala 


Thr 


Gly 


val 


Thr 


Pro 


Asn Val Glu Pro Phe Leu His Gly 


Asn 










320 




325 


330 


Ser 


Phe 


Asp 


Leu 


Gly 


Glu 


Asp Gly Gly Leu Lys Val Asp Asp 


His 










335 




340 


345 


Met 


His 


Thr 


Ser 


Leu 


Pro 


Asp He Tyr Ala Ala Gly Asp He 


Cys 










350 




355 


360 


Thr 


Thr 


Ser 


Trp 


Gin 


Leu 


Ser Pro Val Trp Gin Gin Met Arg 


Leu 










365 




370 


375 


Trp 


Thr 


Gin 


Ala 


Arg 


Gin 


Met Gly Trp Tyr Ala Ala Lys Cys 


Met 










380 




385 


390 


Ala 


Ala 


Ala 


Ser 


Ser 


Gly 


Asp Ser He Asp Met Asp Phe Ser 


Phe 










395 




400 


405 


Glu 


Leu 


Phe 


Ala 


His 


Val 


Thr Lys Phe Phe Asn Tyr Lys Val 


Val 










410 




415 


420 


Leu 


Leu 


Gly 


Lys 


Tyr 


Asn 


Ala Gin Gly Leu Gly Ser Asp His 


Glu 










425 




430 


435 
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Leu 


Met Leu Arg 


Cys 


Thr 


Lys 






440 






Met 


Gin Asn Gly 


Arg 


Met 


Met 






455 






Asp 


Leu Glu Glu 


Thr 


Phe 


Glu 






470 






Leu 


Ser Ser Tyr 


Gly 


Glu Asp 






485 






Glu Asp Tyr Phe 


Asp 










500 







Gly Arg 


Glu 


Tyr 


He Lys Val 


Val 




445 






450 


Gly Ala 


Val 


Leu 


He Gly Glu 


Thr 




460 






465 


Asn Leu 


He 


Leu 


Asn Gin Met 


Asn 




475 






480 


Leu Leu 


Asp 


Pro 


Asn He Asp 


He 




490 






495 



<210> 15 

<211> 369 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2754425CD1 

<400> 15 



Met 


Val 


Trp 


Ala 


Pro 


Leu 


Gly 


Pro Pro 


Arg 


Thr Asp Cys Leu 


Thr 


1 








5 








10 




15 


Leu 


Leu 


His 


Thr 


Pro 


Ser 


Lys 


Asp Ser 


Pro 


Lys Met Ser Leu 


Glu 










20 








25 




30 


Trp 


Leu 


Val 


Ala 


Trp 


Ser 


Trp 


Ser Leu 


Asp 


Gly Leu Arg Asp 


Cys 










35 








40 




45 


He 


Ala 


Thr 


Gly 


He 


Gin 


Ser 


Val Arg 


Asp 


Cys Asp Thr Thr 


Ala 










50 








55 




60 


Val 


He 


Thr 


Val 


Ala 


Cys 


Leu 


Leu Val 


Leu 


Phe Val Trp Tyr 


Cys 










65 








70 




75 


Tyr 


His 


Val 


Gly 


Arg 


Glu 


Gin 


Pro Arg 


Pro 


Tyr Val Ser Val 


Asn 










80 








85 




90 


Ser 


Leu 


Met 


Gin 


Ala 


Ala 


Asp 


Ala Asn 


Gly 


Leu Gin Asn Gly 


Tyr 










95 








100 




105 


Val 


Tyr 


Cys 


Gin 


Ser 


Pro 


Glu 


Cys Val 


Arg 


Cys Thr His Asn 


Glu 










110 








115 




120 


Gly 


Leu 


Asn 


Gin 


Lys 


Leu 


Tyr 


His Asn 


Leu 


Gin Glu Tyr Ala 


Lys 










125 








130 




135 


Arg 


Tyr 


Ser 


Trp 


Ser 


Gly 


Met 


Gly Arg 


He 


His Lys Gly He 


Arg 










140 








145 




150 


Glu 


Gin 


Gly 


Arg 


Tyr 


Leu 


Asn 


Ser Arg 


Pro 


Ser He Gin Lys 


Pro 










155 








160 




165 


GlU 


Val 


Phe 


Phe 


Leu 


Pro 


Asp 


Leu Pro 


Thr 


Thr Pro Tyr Phe 


Ser 










170 








175 




180 


Arg 


Asp 


Ala 


Gin 


Lys 


His 


Asp 


Val Glu 


Val 


Leu Glu Arg Asn 


Phe 










185 








190 




195 


Gin 


Thr 


He 


Leu 


Cys 


Glu 


Phe 


Glu Thr 


Leu 


Tyr Lys Ala Phe 


Ser 










200 








205 




210 


Asn 


Cys 


Ser 


Leu 


Pro 


Gin 


Gly 


Trp Lys 


Met 


Asn Ser Thr Pro 


Ser 










215 








220 




225 


Gly 


Glu 


Trp 


Phe 


Thr 


Phe 


Tyr 


Leu Val 


Asn 


Gin Gly Val Cys 


Val 










230 








235 




240 


Pro 


Arg 


Asn 


Cys 


Arg 


Lys 


Cys 


Pro Arg 


Thr 


Tyr Arg Leu Leu 


Gly 










245 








250 




255 


Ser 


Leu 


Arg 


Thr 


Cys 


He 


Gly 


Asn Asn 


Val 


Phe Gly Asn Ala 


Cys 










260 








265 




270 


He 


Ser 


Val 


Leu 


Ser 


Pro 


Gly 


Thr Val 


He 


Thr Glu His Tyr 


Gly 










275 








280 




285 


Pro 


Thr 


Asn 


He 


Arg 


He 


Arg 


Cys His 


Leu 


Gly Leu Lys Thr 


Pro 










290 








295 




300 


Asn 


Gly 


Cys 


Glu 


Leu 


Val 


Val 


Gly Gly 


Glu 


Pro Gin Cys Trp 


Ala 










305 








310 




315 


Glu 


Gly 


Arg 


Cys 


Leu 


Leu 


Phe 


Asp Asp 


Ser 


Phe Leu His Ala 


Ala 
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320 325 330 

Phe His Glu Gly Ser Ala Glu Asp Gly Pro Arg Val Val Phe Met 

335 340 345 

Val Asp Leu Trp His Pro Asn Val Ala Ala Ala Glu Arg Gin Ala 

350 355 360 

Leu Asp Phe lie Phe Ala Pro Gly Arg 

365 



<210> 16 

<211> 145 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> roisc_feature 

<223> Incyte ID No: 2821526CD1 

<400> 16 



Met 


Glu 


Leu 


Val Gin 


Val 


Leu 


Lys Arg Gly 


Leu Gin Gin He 


Thr 


1 






5 






10 




15 


Gly 


His 


Gly 


Gly Leu 


Arg 


Gly 


Tyr Leu Arg 


Val Phe Phe Arg 


Thr 








20 






25 




30 


Asn 


Asp 


Ala 


Lys Val 


Gly 


Thr 


Leu Val Gly 


Glu Asp Lys Tyr 


Gly 








35 






40 




45 


Asn 


Lys 


Tyr 


Tyr Glu 


Asp 


Asn 


Lys Gin Phe 


Phe Gly Arg His 


Arg 








50 






55 




60 


Trp 


Val 


Val 


Tyr Thr 


Thr 


Glu 


Met Asn Gly 


Lys Asn Thr Phe 


Trp 








65 






70 




75 


Asp 


Val 


Asp 


Gly Ser 


Met 


Val 


Pro Pro Glu 


Trp His Arg Trp 


Leu 








80 






85 




90 


His 


Ser 


Met 


Thr Asp 


Asp 


Pro 


Pro Thr Thr 


Lys Pro Leu Thr 


Ala 








95 






100 




105 


Arg 


Lys 


Phe 


He Trp 


Thr 


Asn 


His Lys Phe 


Asn Val Thr Gly 


Thr 








110 






115 




120 


Pro 


Glu 


Gin 


Tyr Val 


Pro 


Tyr 


Ser Thr Thr 


Arg Lys Lys He 


Gin 








125 






130 




135 


Glu 


Trp 


He 


Pro Pro 


Ser 


Thr 


Pro Tyr Lys 












140 






145 







<210> 17 
<211> 255 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 2876494CD1 

<400> 17 



Met 


Lys 


Val 


Leu 


Ala 


Thr 


Ser 


Phe Val Leu 


Gly Ser Leu Gly 


Leu 


1 








5 






10 


15 


Ala 


Phe 


Tyr 


Leu 


Pro 


Leu 


Val 


Val Thr Thr 


Pro Lys Thr Leu 


Ala 










20 






25 


30 


He 


Pro 


Glu 


Lys 


Leu 


Gin 


Glu 


Ala Val Gly 


Lys Val He He 


Asn 










35 






40 


45 


Ala 


Thr 


Thr 


Cys 


Thr 


Val 


Thr 


Cys Gly Leu 


Gly Tyr Lys Glu 


Glu 










50 






55 


60 


Thr 


Val 


Cys 


Glu 


Val 


Gly 


Pro 


Asp Gly Val 


Arg Arg Lys Cys 


Gin 










65 






70 




75 


Thr 


Gin 


Arg 


Leu 


Glu 


Cys 


Leu 


Thr Asn Trp 


He Cys Gly Met 


Leu 










80 






85 




90 


His 


Phe 


Thr 


He 


Leu 


He 


Gly 


Lys Glu Phe 


Glu Leu Ser Cys 


Leu 
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95 



Ser 


Ser 


Asp lie 


Leu 


Giu Phe Gly 








110 




Trp 


Arg 


Leu Ala 


Arg 


Gly Val He 








125 




Lys 


Pro 


Phe Gin 


Ala 


Asn Ser His 








140 




Gin 


GlU 


Tyr Asp 


Ser 


Gly Thr Tyr 








155 




Lys 


Asn 


Leu Arg 


Leu 


Val Lys Arg 








170 




Leu 


Pro 


Pro Asn 


Leu 


Val Asn Leu 








1 QC 

La D 




Glu 


Asp 


Gin Lys 


Leu 


He Asp Glu 








200 




ser 


Tyr 


Ser Lys 


Pro 


His His Pro 








215 




Ser 


Ala 


Leu Gly 


He 


Gly He Ala 








230 




Leu 


Val 


Arg lie 


Val 


Leu Cys Ala 



245 



<210> 18 

<211> 246 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: 3403225CD1 

<400> 18 



Met 


Leu 


Val 


Thr 


Leu 


Gly 


Leu 


Leu 


1 








5 






Tyr 


Met 


Val 


Ala 


Pro 


Ser 


He 


Arg 










20 








Cys 


Arg 


Thr 


Asn 


Val 


Gin 


Leu 


Pro 










35 








Gly 


Ala 


Asn 


Thr 


Gly 


He 


Gly 


Lys 










50 








Ser 


Arg 


Gly 


Ala 


Arg 


Val 


Tyr 


He 










65 








Gly 


Glu 


Ser 


Ala 


Ala 


Ser 


Glu 


He 










80 








Gin 


Val 


Leu 


Val 


Arg 


Lys 


Leu 


Asp 










95 








Arg 


Ala 


Phe 


Ala 


Glu 


Gly 


Phe 


Leu 










110 






He 


Leu 


He 


Asn 


Asn 


Ala 


Gly 


Val 










125 






Thr 


Ala 


Asp 


Gly 


Phe 


Glu 


Thr 


His 










140 








Thr 


Gly 


Val 


Thr 


Thr 


Tyr 


Ala 


Val 










155 








Glu 


Leu 


Val 


Arg 


His 


Ser 


Ser 


Leu 










170 








Phe 


Ser 


Pro 


Phe 


Val 


Lys 


Thr 


Ala 










185 








Leu 


His 


Cys 


Ala 


Leu 


Ala 


Glu 


Gly 










200 








Tyr 


Phe 


Ser 


Asp 


Cys 


Lys 


Arg 


Thr 










215 








Asn 


Asn 


Lys 


Thr 


Ala 


Glu 


Arg 


Leu 



230 



100 




105 


Gin Glu Ala Phe 


Arg 


Phe Thr 


115 




120 


Ser Thr Asp Asp 


Glu 


Val Phe 


130 




135 


Phe Val Lys Phe 


Lys 


Tyr Ala 


145 




150 


Arg Cys Asp Val 


Gin 


Leu Val 


160 




165 


Leu Tyr Phe Gly 


Leu 


Arg Val 


175 




180 


Asn Phe His Gin 


Ser 


Leu Thr 


190 




195 


Gly Leu Glu Val 


Asn 


Leu Asp 


205 




210 


Lys Trp Lys Lys 


Lys 


Val Ala 


220 




225 


He Gly Val Val 


Gly 


Gly Val 


235 




240 


Leu Arg Gly Gly 


Leu 


Gin Gin 


250 




255 



Thr Ser Phe Phe 


Ser 


Phe 


Leu 


10 






15 


Lys Phe Phe Ala 


Gly 


Gly Val 


25 






30 


Gly Lys Val Val 


Val 


He 


Thr 


40 






45 


Glu Thr Ala Arg 


Glu 


Leu 


Ala 


55 






60 


Ala Cys Arg Asp 


Val 


Leu Lys 


70 






75 


Arg Val Asp Thr 


Lys 


Asn 


Ser 


85 






90 


Leu Ser Asp Thr 


Lys 


Ser 


He 


100 






105 


Ala Glu Glu Lys 


Gin 


Leu 


His 


115 






120 


Met Met Cys Pro 


Tyr 


Ser 


Lys 


130 






135 


Leu Gly Val Asn 


His 


Leu Gly 


145 






150 


His Pro Gly Val 


Val 


Arg 


Ser 


160 






165 


Leu Cys Leu Leu 


Trp 


Arg 


Leu 


175 






180 


Arg Glu Gly Ala 


Gin 


Thr 


Ser 


190 






195 


Leu Glu Pro Leu 


Ser 


Gly Lys 


205 






210 


Trp Val Ser Pro 


Arg 


Ala 


Arg 


220 






225 


Trp Asn Val Ser 


Cys 


Glu 


Leu 


235 






240 
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Leu Gly lie Arg Trp Glu 
245 



<210> 19 
<211> 467 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: 4163943CD1 

<400> 19 



Met 


Ala 


Ala 


Ala 


Ala 


Arg 


Ala 


Arg Val Ala Tyr Leu 


Leu Arg Gin 


1 








5 






10 


15 


Leu 


Gin 


Arg 


Ala 


Ala 


Cys 


Gin 


Cys Pro Thr His Ser 


His Thr Tyr 










20 






25 


30 


Ser 


Gin 


Ala 


Pro 


Gly 


Leu 


Ser 


Pro Ser Gly Lys Thr 


Thr Asp Tyr 










35 






40 


45 


Ala 


Phe 


Glu 


Met 


Ala 


Val 


Ser 


Asn He Arg Tyr Gly 


Ala Ala Val 










50 






55 


60 


Thr 


Lys 


Glu 


Val 


Gly 


Met 


Asp 


Leu Lys Asn Met Gly 


Ala Lys Asn 










65 






70 


75 


Val 


Cys 


Leu 


Met 


Thr 


Asp 


Lys 


Asn Leu Ser Lys Leu 


Pro Pro Val 








80 






85 


90 


Gin 


Val 


Ala 


Met 


Asp 


Ser 


Leu 


Val Lys Asn Gly He 


Pro Phe Thr 










95 






100 


105 


Val 


Tyr 


Asp 


Asn 


Val 


Arg 


Val 


Glu Pro Thr Asp Ser 


Ser Phe Met 










110 






115 


120 


Glu 


Ala 


He 


Glu 


Phe 


Ala 


Gin 


Lys Gly Ala Phe Asp 


Ala Tyr Val 










125 






130 


135 


Ala 


Val 


Gly 


Gly 


Gly 


Ser 


Thr 


Met Asp Thr Cys Lys 


Ala Ala Asn 










140 






145 


150 


Leu 


Tyr 


Ala 


Ser 


Ser 


Pro 


His 


Ser Asp Phe Leu Asp 


Tyr Val Ser 








155 






160 


165 


Ala 


Pro 


He 


Gly 


Lys 


Gly 


Lys 


Pro Val Ser Val Pro 


Leu Lys Pro 










17 0 






175 


180 


Leu 


He 


Ala 


Val 


Pro 


Thr 


Thr 


Ser Gly Thr Gly Ser 


Glu Thr Thr 










185 






190 


195 


Gly 


Val 


Ala 


He 


Phe 


Asp 


Tyr 


Glu His Leu Lys Val 


Lys He Gly 










200 






205 


210 


lie 


Thr 


Ser 


Arg 


Ala 


He 


Lys 


Pro Thr Leu Gly Leu 


He Asp Pro 










215 






220 


225 


Leu 


His 


Thr 


Leu 


His 


Met 


Pro 


Ala Arg Val Val Ala 


Asn Ser Gly 










230 






235 


240 


Phe 


Asp 


Val 


Leu 


Cys 


His 


Ala 


Leu Glu Ser Tyr Thr 


Thr Leu Pro 








245 






250 


255 


Tyr 


His 


Leu 


Arg 


Ser 


Pro 


Cys 


Pro Ser Asn Pro He 


Thr Arg Pro 






260 




265 


270 


Ala 


Tyr 


Gin 


Gly 


Ser 


Asn 


Pro 


He Ser Asp He Trp 


Ala He His 






275 






280 


285 


Ala 


Leu 


Arg 


He 


Val 


Ala 


Lys 


Tyr Leu Lys Arg Ala 


Val Arg Asn 










290 






295 


300 


Pro 


Asp 


Asp 


Leu 


Glu 


Ala 


Arg 


Ser His Met His Leu 


Ala Ser Ala 










305 






310 


315 


Phe 


Ala 


Gly 


He 


Gly 


Phe 


Gly 


Asn Ala Gly Val His 


Leu Cys His 










320 






325 


330 


Gly 


Met 


Ser 


Tyr 


Pro 


He 


Ser 


Gly Leu Val Lys Met 


Tyr Lys Ala 










335 






340 


345 


Lys 


Asp 


Tyr 


Asn 


Val 


Asp 


His 


Pro Leu Val Pro His 


Gly Leu Ser 










350 






355 


360 


Val 


Val 


Leu 


Thr 


Ser 


Pro 


Ala 


Val Phe Thr Phe Thr 


Ala Gin Met 










365 






370 


375 


Phe 


Pro 


Glu 


Arg 


His 


Leu 


Glu 


Met Ala Glu He Leu 


Gly Ala Asp 
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380 




385 


390 


Thr 


Arg 


Thr Ala 


Arg 


He 


Gin Asp Ala Gly Leu Val Leu 


Ala Asp 








395 




400 


405 


Thr 


Leu 


Arg Lys 


Phe 


Leu 


Phe Asp Leu Asp Val Asp Asp 


Gly Leu 






410 




415 


420 


Ala 


Ala 


Val Gly 


Tyr 


Ser 


Lys Ala Asp He Pro Ala Leu 


Val Lys 








425 




430 


435 


Gly 


Thr 


Leu Pro 


Gin 


Glu 


Arg Val Thr Lys Leu Ala Pro 


Arg Pro 






440 




445 


450 


Gin 


Ser 


Glu Glu 


Asp 


Leu 


Ala Ala Leu Phe Glu Ala Ser 


Met Lys 








455 




460 


465 


Leu 


Tyr 













<210> 20 

<211> 317 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: 4293484CD1 

<400> 20 



Met 


Ala 


Asp Ser 


Ala 


Gin 


Ala 


Gin Lys Leu Val Tyr Leu 


Val Thr 


1 






5 






10 


15 


Gly 


Gly 


Cys Gly 


Phe 


Leu 


Gly 


Glu His Val Val Arg Met 


Leu Leu 








20 






25 


30 


Gin 


Arg 


Glu Pro 


Arg 


Leu 


Gly 


Glu Leu Arg Val Phe Asp 


Gin His 








35 






40 


45 


Leu 


Gly 


Pro Trp 


Leu 


Glu 


Glu 


Leu Lys Thr Gly Thr Arg 


Asn Val 








50 






55 


60 


He 


Glu 


Ala Cys 


Val 


Gin 


Thr 


Gly Thr Arg Phe Leu Val 


Tyr Thr 








65 






70 


75 


Ser 


Ser 


Met Glu 


Val 


Val 


Gly 


Pro Asn Thr Lys Gly His 


Pro Phe 








80 






85 


90 


Tyr 


Arg 


Gly Asn 


Glu 


Asp 


Thr 


Pro Tyr Glu Ala Val His 


Arg His 








95 






100 


105 


Pro 


Tyr 


Pro Cys 


Ser 


Lys 


Ala 


Leu Ala Glu Trp Leu Val 


Leu Glu 




110 






115 


120 


Ala 


Asn 


Gly Arg 


Lys 


Val 


Arg 


Gly Gly Leu Pro Leu Val 


Thr Cys 








125 






13 0 


135 


Ala 


Leu 


Arg Pro 


Thr 


Gly 


He 


Tyr Gly Glu Gly His Gin 


He Met 








140 






145 


150 


Arg 


Asp 


Phe Tyr 


Arg 


Gin 


Gly 


Leu Arg Leu Gly Gly Trp 


Leu Phe 








155 






160 


165 


Arg 


Ala 


He Pro 


Ala 


Ser 


Val 


Glu His Gly Arg Val Tyr 


Val Gly 








170 






175 


180 


Asn 


Val 


Ala Trp 


Met 


His 


Val 


Leu Ala Ala Arg Glu Leu 


Glu Gin 






185 






190 


195 


Arg 


Ala 


Thr Leu 


Met 


Gly 


Gly 


Gin Val Tyr Phe Cys Tyr 


Asp Gly 








200 






205 


210 


Ser 


Pro 


Tyr Arg 


Ser 


Tyr 


Glu 


Asp Phe Asn Met Glu Phe 


Leu Gly 






215 






220 


225 


Pro 


cys 


Gly Leu 


Arg 


Leu 


Val 


Gly Ala Arg Pro Leu Leu 


Pro Tyr 






230 






235 


240 


Trp 


Leu 


Leu Val 


Phe 


Leu 


Ala 


Ala Leu Asn Ala Leu Leu 


Gin Trp 






245 






250 


255 


Leu 


Leu 


Arg. Pro 


Leu 


Val 


Leu 


Tyr Ala Pro Leu Leu Asn 


Pro Tyr 








260 






265 


270 


Thr 


Leu 


Ala Val 


Ala 


Asn 


Thr 


Thr Phe Thr Val ser Thr 


Asp Lys 








275 






280 


285 


Ala 


Gin 


Arg His 


Phe 


Gly 


Tyr 


Glu Pro Leu Phe Ser Trp 


Glu Asp 








290 






295 


300 


Ser 


Arg 


Thr Arg 


Thr 


He 


Leu 


Trp Val Gin Ala Ala Thr 


Gly Ser 
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Ala Gin 



<210> 21 

<211> 181 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 4440080CD1 



<400> 21 
Met Phe Ser 
1 

lie Asn Asn 


Ser 


Thr 


Ser 


Leu 


Ser 


He 


Asn 


Val 


Asp 


His 


Arg 


Val 


Lys 


Tyr 


Ala 


Arg 


Glu 


Ala 


Val 


Val 


Glu 


Glu 


Lys 


Ala 


Glu 


Asp 


val 


His 


He 


Gin 


Thr 







Ala 


He 


Arg 




5 




Ala 


Gly 


Leu 




20 




Gly 


Trp 


Lys 




35 




Cys 


Thr 


Arg 




50 




Asp 


Gly 


His 




65 




Leu 


Pro 


Leu 




80 




Val 


Thr 


Ala 




95 




Gin 


Thr 


His 




110 




Thr 


Gin 


Phe 




125 




Ala 


Ala 


Thr 




140 




Ala 


GlU 


Ala 




155 




He 


Gly 


Asp 




170 





Ser Gin His 
Ala Arg Pro 
Asp Met Phe 
Glu Ala Tyr 
He He Asn 
Ser Val Thr 
Leu Thr Glu 
He Arg Ala 
Ala Phe Lys 
Tyr Glu Gin 
Val He Tyr 
He Gin Met 



Ser 


Gly Val 


10 






Asp 


Thr 


Leu 


25 






Asn 


Val 


Asn 


40 






Gin 


Ser 


Met 


55 






He 


Asn 


Ser 


70 






His 


Phe 


Tyr 


85 






Gly 


Leu Arg 


100 






Thr 


Cys 


He 


115 






Leu 


His 


Asp 


130 






Met 


Lys 


Cys 


145 






Val 


Leu 


Ser 


160 






Arg 


Pro 


Thr 


175 







Asp 


He 


Cys 






15 


Leu 


Ser 


Gly 






30 


Val 


Leu 


Ala 






45 


Lys 


Glu 


Arg 






60 


Met 


Ser 


Gly 






75 


Ser 


Ala 


Thr 






90 


Gin 


Glu 


Leu 






105 


Ser 


Pro 


Gly 






120 


Lys 


Asp 


Pro 






135 


Leu 


Lys 


Pro 






150 


Thr 


Pro 


Ala 






165 


Glu 


Gin 


Val 






180 



<210> 22 

<211> 360 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 5495687CD1 

<400> 22 



Met Val Pro 


Ala 


Ala 


Gly 


Arg 


Arg Pro Pro 


Arg Val Met Arg 


Leu 


1 




5 






10 




15 


Leu Gly Trp 


Trp 


Gin 


Val 


Leu 


Leu Trp Val 


Leu Gly Leu Pro 


Val 






20 






25 




30 


Arg Gly Val 


Glu 


Val 


Ala 


Glu 


Glu Ser Gly 


Arg Leu Trp ser 


Glu 






35 






40 




45 


Glu Gin Pro 


Ala 


His 


Pro 


Leu 


Gin Val Gly 


Ala Val Tyr Leu 


Gly 






50 






55 


60 


Glu Glu Glu 


Leu 


Leu 


His 


Asp 


Pro Met Gly 


Gin Asp Arg Ala 


Ala 






65 






70 




75 


Glu Glu Ala 


Asn 


Ala 


val 


Leu 


Gly Leu Asp 


Thr Gin Gly Asp 


His 






80 






85 




90 
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Met 


Val 


Met 


Leu 


Ser 


Val 


He 


Pro Gly Glu Ala Glu Asp Lys Val 










95 








100 


105 


Ser 


Ser 


Glu 


Pro 


Ser Gly Val 


Thr Cys Gly Ala Gly Gly Ala Glu 










110 








115 


120 


Asp 


Ser 


Arg 


Cys 


Asn 


Val 


Arg 


Glu 


Ser Leu 


Phe Ser Leu Asp Gly 






125 








130 


135 


Ala 


Gly 


Ala 


His 


Phe 


Pro 


Asp 


Arg Glu Glu Glu Tyr Tyr Thr Glu 








140 








145 


150. 


Pro 


Glu 


Val 


Ala 


Glu 


Ser Asp 


Ala 


Ala Pro 


Thr Glu Asp Ser Asn 










155 








160 


165 


Asn 


Thr 


Glu 


Ser 


Leu 


Lys 


Ser 


Pro 


Lys Val 


Asn Cys Glu Glu Arg 










170 








175 


180 


Asn 


lie 


Thr 


Gly 


Leu 


Glu 


Asn 


Phe 


Thr Leu 


Lys He Leu Asn Met 








185 








190 


195 


Ser 


Gin 


Asp 


Leu 


Met 


Asp 


Phe 


Leu 


Asn Pro 


Asn Gly Ser Asp Cys 










200 








205 


210 


Thr 


Leu 


Val 


Leu 


Phe 


Tyr 


Thr 


Pro 


Trp Cys 


Arg Phe Ser Ala Ser 










215 






220 


225 


Leu 


Ala 


Pro 


His 


Phe 


Asn 


Ser 


Leu 


Pro Arg Ala Phe Pro Ala Leu 










230 








235 


240 


His 


Phe 


Leu 


Ala 


Leu Asp Ala 


Ser 


Gin His 


Ser Ser Leu Ser Thr 










245 








250 


255 


Arg 


Phe 


Gly 


Thr 


Val 


Ala 


Val 


Pro 


Asn He 


Leu Leu Phe Gin Gly 






260 








265 


270 


Ala 


Lys 


Pro 


Met 


Ala 


Arg 


Phe 


Asn 


His Thr 


Asp Arg Thr Leu Glu 










275 








280 


285 


Thr 


Leu 


Lys 


He 


Phe 


He 


Phe 


Asn 


Gin Thr 


Gly He Glu Ala Lys 








290 








295 


300 


Lys 


Asn 


Val 


Val 


Val 


Thr 


Gin 


Ala Asp Gin 


He Gly Pro Leu Pro 








305 








310 


315 


Ser 


Thr 


Leu 


He 


Lys 


Ser 


Val 


Asp Trp Leu 


Leu Val Phe Ser Leu 










320 








325 


330 


Phe 


Phe 


Leu 


He 


Ser 


Phe 


He 


Met 


Tyr Ala 


Thr He Arg Thr Glu 










335 








340 


345 


Ser 


He 


Arg 


Trp 


Leu 


He 


Pro 


Gly Gin Glu Gin Glu His Val Glu 






350 








355 


360 



<210> 23 
<211> 476 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 5527735CD1 

<400> 23 



Met 


Ala 


Ser 


Pro 


Phe 


ser 


Gly 


Ala Leu Gin 


Leu Thr Asp Leu Asp 


1 








5 




10 


15 


Asp 


Phe 


He 


Gly 


Pro 


Ser 


Gin 


Glu Cys He 


Lys Pro Val Lys Val 










20 






25 


30 


Glu 


Lys 


Arg 


Ala 


Gly 


Ser 


Gly 


Val Ala Lys 


He Arg He Glu Asp 










35 






40 


45 


Asp 


Gly 


Ser 


Tyr 


Phe 


Gin 


He 


Asn Gin Asp 


Gly Gly Thr Arg Arg 






50 






55 


60 


Leu 


Glu 


Lys 


Ala 


Lys 


Val 


Ser 


Leu Asn Asp 


Cys Leu Ala Cys Ser 








65 






70 


75 


Gly 


Cys 


He 


Thr 


Ser 


Ala 


Glu 


Thr Val Leu 


He Thr Gin Gin Ser 






80 






85 


90 


His 


Glu 


Glu 


Leu 


Lys 


Lys 


Val 


Leu Asp Ala Asn Lys Met Ala Ala 










95 




100 


105 


Pro 


Ser 


Gin 


Gin 


Arg 


Leu 


Val 


Val Val ser 


Val Ser Pro Gin Ser 










110 






115 


120 


Arg 


Ala 


Ser 


Leu 


Ala 


Ala 


Arg 


Phe Gin Leu 


Asn Pro Thr Asp Thr 
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125 



Ala 


Arg 


Lys Leu 


Thr 
140 


Ser 


Phe 


Val 


Phe 


Asp Thr 


Ala 
155 


Phe 


Ser 


Gin 


Arg 


Glu Phe 


Val 
170 


Arg 


Arg 


Gin 


Ala 


Leu Pro 


Leu 


Leu 


Ala 








185 




Tyr 


Ala 


Glu Lys 


Thr 
200 


His 


Gly 


Thr 


Ala 


Arg Ser 


Pro 
215 


Gin 


Gin 


Phe 


Phe 


Ala Gin 


Gin 
230 


Gin 


His 


Val 


Thr 


Val Met 


Pro 
245 


Cys 


Tyr 


Pro 


Asp 


Phe Phe 


Asn 


Gin 


Glu 






260 






Val 


Leu 


Thr Thr 


Gly 
275 


Glu 


Val 


Val 


Ser 


Leu Pro 


Asp 
290 


Leu 


Glu 


Ser 


Gly 


Ala Ser 


Ala 
305 


GlU 


Glu 


Ser 


Gly 


Gly Tyr 


Leu 
320 


GlU 


His 


Leu 


Phe 


Gly He 


His 


Val 


Ala 






335 






Asn 


Lys 


Asp Phe 


Gin 
350 


Glu 


Val 


Leu 


Leu 


His Phe 


Ala 
365 


Met 


Ala 


Leu 


Val 


Gin Arg 


Leu 
380 


Lys 


Arg 


Glu 


Val 


Met Ala 


Cys 
395 


Pro 


Ser 


Leu 


Gin 


Ala Pro 


Asp 
410 


Arg 


Pro 


Glu 


Arg 


Leu Tyr 


Gly 
425 


Met 


Val 


Pro 


Gly 


Val Gin 


Glu 
440 


Leu 


Tyr 


Ser 


Glu 


Cys Ala 


Gly 
455 


Arg 


Leu 


Glu 


Lys 


Ala Ser 


Thr 


Gly 


Leu 



47 0 







130 




135 


Phe 


Lys 


Lys 


He Gly Val His 


Phe 






145 




150 


Arg 


His 


Phe 


Ser Leu Leu Glu 


Ser 




160 




165 


Phe 


Arg 


Gly 


Gin Ala Asp Cys 


Arg 






175 




180 


Ser 


Ala 


Cys 


Pro Gly Trp He 


Cys 






190 




195 


Ser 


Phe 


He 


Leu Pro His He 


Ser 






205 




210 


Val 


Met 


Gly 


Ser Leu Val Lys 


Asp 






220 




225 


Leu 


Thr 


Pro 


Asp Lys He Tyr 


His 






235 




240 


Asp 


Lys 


Lys 


Leu Glu Ala Ser 


Arg 






250 




255 


His 


Gin 


Thr 


Arg Asp Val Asp 


Cys 






265 




270 


Phe 


Arg 


Leu 


Leu Glu Glu Glu 


Gly 






280 




285 


Pro 


Ala 


Pro 


Leu Asp Ser Leu 


Cys 






295 




300 


Pro 


Thr 


Ser 


His Arg Gly Gly 


Gly 






310 




315 


Val 


Phe 


Arg 


His Ala Ala Arg 


Glu 






325 




330 


Glu 


Val 


Thr 


Tyr Lys Pro Leu 


Arg 






340 




345 


Thr 


Leu 


Glu 


Lys Glu Gly Gin 


Val 






355 




360 


Tyr 


Gly 


Phe 


Arg Asn He Gin 


Asn 






370 




375 


Gly 


Arg 


Cys 


Pro Tyr His Tyr 


Val 






385 




390 


Gly 


Cys 


Leu 


Asn Gly Gly Gly 


Gin 






400 




405 


Ser 


Arg Glu 


Leu Leu Gin His 


Val 






415 




420 


Arg 


Ala 


Glu 


Ala Pro Glu Asp 


Ala 






430 


435 


Thr 


His 


Trp 


Leu Gin Gly Thr 


Asp 






445 




450 


Leu 


His 


Thr 


Gin Tyr His Ala 


Val 






460 




465 


Gly 


He 


Arg 


Trp 








475 







<210> 24 

<211> 621 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 5540437CD1 

<400> 24 

Met Ser Gly Cys Gly Leu Phe Leu Arg Thr Thr Ala Ala Ala Arg 
15 10 15 

Ala Cys Arg Gly Leu Val Val Ser Thr Ala Asn Arg Arg Leu Leu 

20 25 30 

Arg Thr Ser Pro Pro Val Arg Ala Phe Ala Lys Glu Leu Phe Leu 

35 40 45 
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Gly 


Lys 


He 


Lys 


Lys 


Lys 


Glu 


Val 


Phe Pro 


Phe Pro Glu Val Ser 










50 






55 


60 


Gin 


Asp 


Glu 


Leu 


Asn 


Glu 


He 


Asn 


Gin Phe 


Leu Gly Pro Val Glu 








65 








70 


75 


Lys 


Phe 


Phe 


Thr 


Glu 


Glu 


Val 


Asp 


Ser Arg 


Lys He Asp Gin Glu 










80 








85 


90 


Gly 


Lys 


He 


Pro 


Asp 


Glu 


Thr 


Leu 


Glu Lys 


Leu Lys Ser Leu Gly 










95 








100 


105 


Leu 


Phe 


Gly 


Leu 


Gin 


Val 


Pro 


Glu 


Glu Tyr 


Gly Gly Leu Gly Phe 










110 








115 


120 


Ser 


Asn 


Thr 


Met 


Tyr 


Ser 


Arg 


Leu 


Gly Glu 


He He Ser Met Asp 










125 






130 


135 


Gly 


Ser 


He 


Thr 


Val 


Thr 


Leu 


Ala 


Ala His 


Gin Ala He Gly Leu 








140 








145 


150 


Lys 


Gly 


He 


He 


Leu 


Ala 


Gly 


Thr 


Glu Glu 


Gin Lys Ala Lys Tyr 










155 








160 


165 


Leu 


Pro 


Lys 


Leu 


Ala 


Ser 


Gly 


Glu 


His He 


Ala Ala Phe Cys Leu 










170 








175 


180 


Thr 


Glu 


Pro 


Ala 


Ser 


Gly 


Ser 


Asp 


Ala Ala 


Ser He Arg Ser Arg 










185 








190 


195 


Ala 


Thr 


Leu 


Ser 


Glu 


Asp 


Lys 


Lys 


His Tyr 


He Leu Asn Gly Ser 










200 








205 


210 


Lys 


Val 


Trp 


He 


Thr 


Asn 


Gly 


Gly 


Leu Ala 


Asn He Phe Thr Val 










215 








220 


225 


Phe 


Ala 


Lys 


Thr 


Glu 


Val 


Val 


Asp 


Ser Asp 


Gly Ser Val Lys Asp 










230 








235 


240 


Lys 


He 


Thr 


Ala 


Phe 


He 


Val 


Glu 


Arg Asp 


Phe Gly Gly Val Thr 








245 








250 


255 


Asn 


Gly 


Lys 


Pro 


Glu 


Asp 


Lys 


Leu 


Gly He 


Arg Gly Ser Asn Thr 










260 








265 


270 


Cys 


Glu 


Val 


His 


Phe 


Glu 


Asn 


Thr 


Lys He 


Pro Val Glu Asn He 








275 








280 


285 


Leu 


Gly 


Glu 


Val 


Gly 


Asp 


Gly 


Phe 


Lys Val 


Ala Met Asn He Leu 










290 








295 


300 


Asn 


Ser 


Gly 


Arg 


Phe 


Ser 


Met 


Gly 


Ser Val 


Val Ala Gly Leu Leu 










305 








310 


315 


Lys 


Arg 


Leu 


He 


Glu 


Met 


Thr 


Ala 


Glu Tyr 


Ala Cys Thr Arg Lys 










320 








325 


330 


Gin 


Phe 


Asn 


Lys 


Arg 


Leu 


Ser 


Glu 


Phe Gly 


Leu He Gin Glu Lys 










335 








340 


345 


Phe 


Ala 


Leu 


Met 


Ala 


Gin 


Lys 


Ala 


Tyr Val 


Met Glu Ser Met Thr 










350 






355 


360 


Tyr 


Leu 


Thr 


Ala 


Gly 


Met 


Leu 


Asp 


Gin Pro 


Gly Phe Pro Asp Cys 








365 








370 


375 


Ser 


He 


Glu 


Ala 


Ala 


Met 


Val 


Lys 


Val Phe 


Ser Ser Glu Ala Ala 










380 






385 


390 


Trp 


Gin 


Cys 


Val 


Ser 


Glu 


Ala 


Leu 


Gin He 


Leu Gly Gly Leu Gly 








395 








400 


405 


Tyr 


Thr 


Arg 


Asp 


Tyr 


Pro 


Tyr 


Glu 


Arg lie 


Leu Arg Asp Thr Arg 










410 








415 


420 


He 


Leu 


Leu 


He 


Phe 


Glu 


Gly 


Thr 


Asn Glu 


He Leu Arg Met Tyr 










425 






430 


435 


He 


Ala 


Leu 


Thr 


Gly 


Leu 


Gin 


His 


Ala Gly 


Arg He Leu Thr Thr 










440 








445 


450 


Arg 


He 


His 


Glu 


Leu 


Lys 


Gin 


Ala 


Lys Val 


Ser Thr Val Met Asp 








455 






460 


465 


Thr 


Val 


Gly 


Arg 


Arg 


Leu 


Arg 


Asp 


Ser Leu 


Gly Arg Thr Val Asp 










470 








475 


480 


Leu 


Gly 


Leu 


Thr 


Gly 


Asn 


His 


Gly 


Val Val 


His Pro Ser Leu Ala 










485 








490 


495 


Asp 


Ser 


Ala 


Asn 


l y s 


Phe 


Glu 


Glu 


Asn Thr 


Tyr Cys Phe Gly Arg 










500 








505 


510 


Thr 


Val 


Glu 


Thr 


Leu 


Leu 


Leu 


Arg 


Phe Gly 


Lys Thr He Met Glu 










515 








520 


525 


Glu 


Gin 


Leu 


Val 


Leu 


Lys 


Arg 


Val 


Ala Asn 


He Leu He Asn Leu 










530 






535 


540 


Tyr 


Gly 


Met 


Thr 


Ala 


Val 


Leu 


Ser 


Arg Ala 


Ser Arg Ser He Arg 
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545 550 555 

lie Gly Leu Arg Asn His Asp His Glu Val Leu Leu Ala Asn Thr 

560 565 570 

Phe Cys Val Glu Ala Tyr Leu Gin Asn Leu Phe Ser Leu Ser Gin 

575 580 585 

Leu Asp Lys Tyr Ala Pro Glu Asn Leu Asp Glu Gin He Lys Lys 

590 595 600 

Val Ser Gin Gin He Leu Glu Lys Arg Ala Tyr He Cys Ala His 

605 610 615 
Pro Leu Asp Arg Thr Cys 

620 



<210> 25 






<211> 245 






<212> PRT 






<213> Homo sapiens 


<220> 






<221> misc_feature 


<223> Incyte ID 


No: 


<400> 25 






Met Gly Arg 


Leu Asp 


1 




5 


Gin Gly He 


Gly Gin 






20 


Ala Lys Val 


He 


Ala 




35 


Leu Glu Lys 


Tyr 


Pro 






50 


Lys Lys Lys 


Gin 


He 






65 


Asp Val Leu 


Phe 


Asn 




80 


Leu Asp Cys 


Glu 


Glu 






95 


Val Arg Ser 


Met 


Tyr 






110 


Leu Ala Gin 


Lys 


Ser 






125 


Ser Ser Val 


Lys 


Gly 






140 


Lys Ala Ala 


Val 


He 




155 


He Gin Gin 


Gly 


He 






170 


Asp Thr Pro 


Ser 


Leu 




185 


Glu Glu Ala 


Arg 


Asn 






200 


Phe Ala Thr 


Ala 


Glu 






215 


Ser Asp Glu 


Ser 


Ala 






230 


Gly Gly Trp 


Ser 


Leu 






245 



5596281CD1 



Gly 


Lys Val He 


He 






10 


Ala 


Ala Ala Leu 


Ala 






25 


Thr 


Asp He Asn 


Glu 






40 


Gly 


He Gin Thr 


Arg 






55 


Asp 


Gin Phe Ala 


Ser 




70 


Val 


Ala Gly Phe Val 






85 


Lys 


Asp Trp Asp 


Phe 






100 


Leu 


Met He Lys 


Ala 






115 


Gly 


Asn He He 


Asn 




130 


Val 


Val Asn Arg Cys 






145 


Gly 


Leu Thr Lys 


Ser 






160 


Arg 


Cys Asn Cys 


Val 






175 


Gin 


Glu Arg He 


Gin 






190 


Asp 


Phe Leu Lys 


Arg 






205 


Glu 


He Ala Met 


Leu 






220 


Tyr 


Val Thr Gly Asn 






235 



Leu Thr Ala Ala Ala 
15 

Phe Ala Arg Glu Gly 
30 

Ser Lys Leu Gin Glu 
45 

Val Leu Asp Val Thr 
60 

Glu Val Glu Arg Leu 
75 

His His Gly Thr Val 
90 

Ser Met Asn Leu Asn 
105 

Phe Leu Pro Lys Met 
120 

Met Ser Ser Val Ala 
135 

Val Tyr Ser Thr Thr 
150 

Val Ala Ala Asp Phe 
165 

Cys Pro Gly Thr Val 
180 

Ala Arg Gly Asn Pro 
195 

Gin Lys Thr Gly Arg 
210 

Cys Val Tyr Leu Ala 
225 

Pro Val He He Asp 
240 



<210> 26 

<211> 159 

<212> PRT 

<213> Homo sapiens 
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<220> 

<221> misc_feature 

<223> Incyte ID No: 5731013 

<400> 26 



Met Thr 


Ala Arg 


Gly 


Thr 


Pro 


Ser Arg 


Phe 


Leu Ala Ser Val 


Leu 


1 






5 








10 




15 


His. Asn 


Gly Leu 


Gly 


Arg 


Tyr 


Val Gin 


Gin 


Leu Gin Arg Leu 


Ser 








20 








25 




30 


Phe Ser 


Val 


Ser 


Arg 


Asp 


Gly 


Ala Ser 


Ser 


Arg Gly Ala Arg 


Glu 








35 








40 




45 


Phe Val 


Glu Arg 


Glu 


Val 


He 


Asp Phe 


Ala 


Arg Arg Asn Pro 


Gly 








50 






55 




60 


Val Val 


lie Tyr 


Val 


Asn 


Ser 


Arg Pro 


Cys 


Cys Val Pro Arg 


Val 








65 








70 




75 


Val Ala 


Glu Tyr 


Leu 


Asn 


Gly 


Ala Val 


Arg 


Glu Glu Ser He 


His 








80 








85 




90 


Cys Lys 


Ser 


Val 


GlU 


Glu 


He 


Ser Thr 


Leu 


Val Gin Lys Leu 


Ala 






95 








100 




105 


Asp Gin 


Ser Gly 


Leu 


Asp 


Val 


He Arg 


He 


Arg Lys Pro Phe 


His 








110 








115 




120 


Thr Asp 


Asn 


Pro 


Ser 


He 


Gin 


Gly Gin 


Trp 


His Pro Phe Thr 


Asn 






125 








130 




135 


Lys Pro 


Thr 


Thr 


Phe 


& r~rf 
-TiJw y 


r;lv 


T ,01 1 Arrr 


Pro 


Arrr Gin Val Gin 


Asp 








140 








145 




150 


Pro Ala 


Pro 


Ala 


Gin 


Val 


Gin 


Ala Gin 














155 














<210> 27 


















<211> 291 


















<212> PRT 


















<213> Homo sapiens 














<220> 




















<221> misc_feature 














<223> Incyte ID 


No: 


5731162CD1 








<400> 27 


















Met Ala 


Cys 


Leu 


Ser 


Pro 


Ser 


Gin Leu 


Gin 


Lys Phe Gin Gin 


Asp 


1 




5 








10 




15 


Gly Phe 


Leu 


Val 


Leu 


Glu 


Gly 


Phe Leu 


Ser 


Ala Glu Glu Cys 


val 






20 






25 




30 


Ala Met 


Gin 


Gin 


Arg 


He 


Gly 


Glu He 


Val 


Ala Glu Met Asp 


Val 








35 








40 




45 


Pro Leu 


His 


Cys 


Arg 


Thr 


Glu 


Phe Ser 


Thr 


Gin Glu Glu Glu 


Gin 






50 








55 




60 


Leu Arg 


Ala 


Gin 


Gly 


Ser 


Thr 


Asp Tyr 


Phe 


Leu Ser Ser Gly 


Asp 








65 








70 




75 


Lys lie 


Arg 


Phe 


Phe 


Phe 


Glu 


Lys Gly 


Val 


Phe Asp Glu Lys 


Gly 








80 








85 




90 


Asn Phe 


Leu 


Val 


Pro 


Pro 


Glu 


Lys Ser 


He 


Asn Lys He Gly 


His 








95 








100 


105 


Ala Leu 


His 


Ala 


His 


Asp 


Pro 


Val Phe 


Lys 


Ser He Thr His 


Ser 








110 








115 




120 


Phe Lys 


Val 


Gin 


Thr 


Leu 


Ala 


Arg Ser 


Leu 


Gly Leu Gin Met 


Pro 








125 








130 




135 


Val Val 


Val 


Gin 


Ser 


Met 


Tyr 


He Phe 


Lys 


Gin Pro His Phe 


Gly 








140 






145 




150 


Gly Glu 


Val 


Ser 


Pro 


His 


Gin 


Asp Ala 


Ser 


Phe Leu Tyr Thr 


Glu 








155 








160 




165 


Pro Leu 


Gly 


Arg 


Val 


Leu 


Gly 


Val Trp 


He 


Ala Val Glu Asp 


Ala 








170 








175 




180 


Thr Leu 


Glu 


Asn 


Gly 


Cys 


Leu 


Trp Phe 


He 


Pro Gly ser His 


Thr 








185 








190 




195 
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Ser Gly 
Pro Gly 
Leu Phe 
His Gly 
Ser Arg 
Thr Trp 
Phe Pro 



Val Ser Arg 
200 

Thr Ser Phe 
215 

Val Pro Thr 
230 

Glu Val Val 
245 

Gin Ala Tyr 
260 

ser Pro Glu 
275 

Gin Leu Tyr 
290 



Arg Met Val Arg Ala 
205 

Leu Gly Ser Glu Pro 
220 

Pro Val Gin Arg Gly 
235 

His Lys Ser Lys Gin 
250 

Thr Phe His Leu Met 
265 

Asn Trp Leu Gin Pro 
280 

Thr 



Pro Val Gly Ser Ala 
210 

Ala Arg Asp Asn Ser 
225 

Ala Leu Val Leu lie 
240 

Asn Leu Ser Asp Arg 
255 

Glu Ala Ser Gly Thr 
270 

Thr Ala Glu Leu Pro 
285 



<210> 28 

<211> 1557 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 543496CB1 

<400> 28 

ctgagtgcga cggcgcaggt ctgcaccatg 
gtgcgtgcag ctccccacag cggcccgctg 
acagacaccg tgtatgacgt ggtggtgtcg 
tgtgccttgg gatatgatat tcactttcat 
ccaaagaaag tactggagaa attgtcagaa 
cctggctctg caacgcttct cagtagtttt 
tacagagcct ttcggcgaat gcaggtgtgg 
gataaggata atttagatga catgggctat 
ctcactaagc agttggaggc tgtgtctgac 
attcgctata cctggccttg tccatttcct 
accctaggtg atggcagcac cttccagacc 
tccggagtac ggcaggctgt tggaatccag 
gttgtggcta ctctgcattt atcagaggcc 
cttccctctg ggcctattgc tctgctcccg 
tccacgtccc atgaacatgc agcagagcta 
gccgttaact ctgccttttg gagtgatgct 
gccatgctgc agtatgctgt cagccttctg 
cccccaagcg tagccagggt ggatgccaaa 
catgctgctg agtacgtcag gcctcgggtg 
catccgcttg caggacaggg tgtcaacatg 
cacctcagta cggcagcctt caatgggaag 
tatgaaacag aaagacagcg tcacaacact 
aggctctatt ctaccagtgc ctccccgctt 
acaaatgcag tgtctccact caaagaacag 
ctctcctaaa gaaagattac gttgatgaaa 
tattttcaag atcttattta atttaataaa 



gcggcccggc ttgtcagccg atgcggggct 60 
gtgtcctggc gcaggtggtc cggcgcctca 12 0 
ggtggaggcc tggtgggcgc tgccatggcc 180 
gacaagaaaa tcctgttgct cgaagcaggt 240 
acttacagca acagggtcag ctccatttcc 3 00 
ggtgcctggg accatatctg caacatgaga 3 60 
gacgcctgct cagaggccct gataatgttt 420 
atcgtggaga atgatgtcat catgcatgct 480 
cgagtgacgg ttctctacag gagcaaagcc 540 
atggccgact ccagcccttg ggttcatatt 600 
aaattgttga taggtgcaga tggtcacaac 660 
aatgtgagct ggaactatga ccagtctgct 720 
acagaaaaca acgtagcctg gcagagattt 780 
ctctcagaca ccttgagttc cttggtttgg 840 
gttagcatgg atgaggaaaa atttgtggat 900 
gaccacacgg acttcatcga cacagctggt 960 
aagcccacta aggtctcggc tcgccagctg 1020 
agccgagttc tgtttcctct tgggttggga 1080 
gcgctcattg gggatgcagc ccacagagtc 1140 
ggctttgggg atatctccag cttggcccat 12 00 
gacttaggtt ccgtgagcca cctcacaggt 12 60 
gctcttctgg ctgctacaga cttactaaaa 1320 
gtgttgctca ggacgtgggg cttgcaggcc 1380 
attatggcct ttgcaagcaa atgagtactc 1440 
aagaacatcc tgcccaggac ccatcataca 1500 
cttactttac attaaaaaaa aaaaaaa 1557 



<210> 29 

<211> 1106 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 907607CB1 

<400> 29 

cccacgcgtc cgggttcccc ggcctctctt ggtcagggtg acgcagtagc ctgcaaacct 60 
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cggcgcgtag gccaccgcac ttatccgcag 
cttcccagtg cccgcccagc taccggccag 
gtcagggaga cccttaggca ctccggacta 
gagctgcggc tgttgccgcc gggctgcgca 
acaccattaa gaaacagcct ctgcatcagt 
cagcctttta tcacccagtg agatacatgt 
acagcttaaa gtttatacca ggaaaaccag 
ccccagctgc agcatttcgc tcccctctgg 
aaagtgtctt ctttggacca gatttcatca 
ggaatttact gaaaccagat atttatgcaa 
ccctggttac tgaggaaaca cc ttcaggag 
tggcaatgat taaggaattg ttagatacta 
gggatgtaat ctacaaaggc tttgaagatg 
gtaccagctg ccctagttca atcattactc 
tttatattcc ggaggtagaa ggcgtagaac 
aagcaaactc accttaaaat aatctggatt 
taatatatat caagttttta ttattaatat 
ctcttcagag aatgatatat aaaaaa 



caggaccgcc cgcagccggt agggtgggct 120 
cctgcggctg cgcagatctt tcgtggttct 180 
agatggcggc gacggccagg cggggctggg 240 
ggcggttctg tcatatgttg aagaatccat 300 
ttgtacaaag accacttttc ccactacctg 3 60 
ttattcaaac acaagatacc ccaaatccaa 420 
ttcttgagac aaggaccatg gattttccca 480 
ctaggcagtt atttaggatt gaaggagtaa 540 
ctgtcacaaa ggaaaatgaa gaattagact 600 
caatcatgga cttctttgca tctggcttac 660 
aagcaggatc tgaagaagat gatgaagttg 720 
gaatacggcc aactgtgcag gaagatggag 780 
gcattgtaca gctgaaactc cagggttctt 840 
tgaaaaatgg aattcagaac atgctgcagt 900 
aggttatgga tgatgaatca gatgaaaaag 960 
ttctttgggc ataacagtca gacttgttga 1020 
gctgaggaac ttgaagatta ataaaatatg 1080 

1106 



<210> 30 

<211> 2180 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 1290078CB1 

<400> 30 

gccccacaca ggccgcggcg gctggctcgg 
ggaagccagg cggctggcgg aggaggagag 
gccgcagact tacttccccg gctcagcagg 
tatgcaaagt tgtgaatcca gtggtgacag 
gagaagggga cagcctcgtg tggtggtgat 
caaagcactt cttgagcagg gtttcacgga 
cggaggccgt gtgcagagtg tgaaacttgg 
gatccatggc tcccatggga accctatcta 
agagacaacc gatggggaac gcagcgtggg 
ggcctgctac cttaccaacc acggccgcag 
cgatttatac aacgaggtct ataacttgac 
caatgctgaa agtcaaaata gcgtgggggt 
caggaatgac cctgacgacc cagaggctac 
gtacctgaag gtggagagct gtgagagcag 
cgccttcggg gagtggaccg agatccccgg 
gcgggttgtg gagctgctgg cggagggcat 
tgtccgctgc attcactggg accaggcctc 
ccggggtgag ggcgaccaca atcacgacac 
ccgggggggc aggtgggatg aggatgagca 
tgagctgatc ccggcggacc atgtgattgt 
gtacaccagt ttcttccggc caggcctgcc 
gggcattggc accaccgaca agatctttct 
gtgcaacagc ctacagtttg tgtgggagga 
acctgagctc tggtaccgca agate tgcgg 
cggccatgtg ctgagegget ggatctgegg 
tgacgaggca gtggccgaga tetgeaegga 
cattccaaaa cctcggcgaa tcttgcgctc 
ctcctattca tacaegcagg tgggctccag 
cctgccgtac aeggagaget caaagacagc 
cacccaccgc aagtactatt ccaccaccca 
tgcccgcctc attgagatgt accgagacct 
getgetgaga agagecacta actcgtgacc 
tgccttcctg atcctctgta gaaaggattt 
gccttcagac ctggccctgt agcttttctt 
ccgttgagtt acctctgtgc tggatcccgt 
tgttattgta agtgccttca atactttgea 
tgcccctcaa aaaaaaaaaa 



gcccctacgg tcccggcggc ggctggagga 60 
aeggaggagg ccgagaccgg agcgccgctc 120 
gaaaggttcc tagaaggtga gegeggaegg 180 
tgeggatgae cctctcagtc gcggcctacg 240 
cggcgccggc ttggctggcc tggctgeage 3 00 
tgtcactgtg cttgaggctt ccagccacat 3 60 
acacgccacc tttgagctgg gagccacctg 420 
tcatctagca gaagccaacg gcctcctgga 480 
ccgcatcagc ctctattcca agaatggcgt 540 
gatccccaag gacgtggttg aggaattcag 600 
ccaggagttc ttccggcacg ataaaccagt 660 
gttcacccga gaggaggtgc gtaacegcat 720 
caagcgcctg aagctcgcca tgatccagca 780 
ctcacacagc atggacgagg tgtccctgag 840 
cgctcaccac atcatcccct egggcttcat 900 
ccctgcccac gtcatccagc tagggaaacc 960 
agcccgcccc agaggecctg agattgagee 1020 
tgggg a gggt ggccagggtg gagaggagee 1080 
gtggtcggtg gtggtggagt gcgaggactg 1140 
gaccgtgtcg ctaggtgtgc taaagaggca 1200 
cacagagaag gtggctgcca tccaccgcct 1260 
ggaattcgag gagcccttct ggggccctga 1320 
egaageggag agccacaccc tcacctaccc 1380 
ctttgatgtc ctctacccgc ctgagegcta 1440 
ggaggaggee ctegtcatgg agaagtgtga 1500 
gatgetgegt cagttcacag ggaaccccaa 1560 
ggcctggggc agcaaccctt acttccgcgg 1620 
eggggeggat gtggagaagc tggccaagcc 1680 
gcccatgcag gtgctgtttt ccggtgaggc 1740 
cggtgctctg ctgtccggcc agegtgagge 1800 
cttccagcag gggacctgag ggctgtcctc 1860 
tccagcctgc cccttgctgc cgtgtgctcc 1920 
ttatcttctg tagagctagc cgccctgact 1980 
tttctccagg ctgggccgtg agcaggtggg 2040 
gcccccactt gcctaccctc tgtcctgcct 2100 
ttttgggata ataaaaaagg ctccctcccc 2160 

2180 
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<210> 31 

<211> 1311 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 1302741CB1 

<400> 31 

ccgccgccca cccgagccta tctgggctgc 
cgccccaatc cttgcgtgtg cttgcagtcc 
ccagtctccg acttccattt cccaccctaa 
gttgtcctcg ccctgctgcg ctgagtgtcc 
acgctgcaga gctcgtgggt gaccttccgc 
agtctggctt tcgtctacgg ctccggggtg 
aagaatgcta tgctggactt tgtgttcaca 
aacctgaaga aaaattggag tcactactct 
acgtccatcc agaataacta tggcgctgga 
ggtaggctta tcaaatatgg agttattagc 
tggaataact tatacattgc tggacgactc 
gaggatgtca ctcttagatc agccctcgat 
ttcctcatgc tccccgaaag cttttctgaa 
tcctattcag gtgactttcg gatggtggtt 
gtgaagccca atatagccca ctttcgagag 
caagtggtgt ataaaagcca gcaaggctgg 
ttcactcagc tgatgacatt gcccaaaacc 
cctcctggaa aaaacagaga tgtggaagaa 
tgtggagatg tggtgcgact agggctttca 
agcacgaaag gcatttttac tgctggcctg 
ctgcacaaaa tgtggaaagg gtggctgagg 
atgttatatg tagatgaata aagtgtttga 



gtcttctcgc cgctgctctt cgtggcccaa 60 
caccccacac tcagccttgt gtccctcgat 120 
accgcctacc cggtgtctgt tccccgcccg 180 
cctgttagcc tcgaccccat ggcgctgcag 240 
aagatcctgt ctcacttccc cgaggagctg 300 
taccgccagg cagggccgag ttcagaccag 3 60 
gtagatgacc ctgtcgcatg gcattcaaag 420 
ttcctaaaag ttttagggcc caagattatc 480 
gtttactaca attcattgat catgtgtaat 540 
actaacgttc tgattgaaga tctcctcaac 600 
caaaaaccgg tgaaaattat ctcagtgaac 660 
agaaatctga agagtgctgt gaccgctgct 720 
gaagacctct tcatagagat tgccggtctc 780 
ggagaagata aaacaaaagt gttgaatatt 840 
ctctatggca gcatactaca ggaaaatcct 900 
ctggagatag ataaaagccc agaaggacag 960 
ttacagcaac agataaatca tattatggac 1020 
actttattcc aagtggctca tgatcccgac 1080 
gcaatcgtga gaccgtctag tataagacag 1140 
aagaagtcag tgatttatag ttcactaaaa 1200 
aaaacatcct gattttgctt gcttttatat 1260 
tcctttttga caaaaaaaaa a 1311 



<210> 32 

<211> 921 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 1541028CB1 

<400> 32 

ggcccctgcg ccccacgtgg cgccctttcc 

gacaactctc ccccctcccc cggcacccgg 

gggtggctga gctgagaggg ctccgggaag 

tttgtgccgc gggacgccca gcggcacgac 

attttgcggg acttcggggc tgtgagctgg 

ggctggtccc cacctctggc ccccgggtgc 

tgccaaccca gcaactgccg ccggtgcccg 

agctttatga gtgccaacac cttcggcaat 

cggctcgagg gccgctgtgg gcccaccaat 

atccctcctg gctgtgagct ggtggtcggc 

tgtctactgg tggacgactc ttttctacac 

gggcctcgag tggtcttcat cgtggacctc 

caggccctcg actttgtctt cgccccagac 

caggctggag agacactgcg ctcagggacg 

tgcgggggtg ggcgggggcg gaggatggga 

tgaatcctca aaaaaaaaaa a 



gggcgggggg ggtggggagg cacctccagc 60 
ctgatttgct gtgccactgg gagggttcgg 120 
gagtgacgtc aggacctgcc ttcagccccc 180 
gtggagctcc tggagagcag cttccctgcc 240 
gacttctcag ggactacccc tccgcctcgg 300 
taccagctcc tgctgtacca agcaggccgg 360 
ggggcctatc gggcactgag ggggcttcga 420 
gccggctttt ccgttctcct gcctggggcc 480 
gcccgggtca gatgccatct gggcctaaag 540 
ggtgagcccc agtgctgggc tgaggggcac 600 
acagtggctc acaatggctc ccccgaagat 660 
tggcacccca acgtggcagg ggctgagcgc 720 
ccttgaagga aggtgctccc ttcacacacc 780 
gcttgatggt agccaggacc tcctctctac 840 
actggctagt gagcactgaa atataaattc 900 

921 



<210> 33 

<211> 2032 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> misc_feature 

<223> Incyte ID No: 1597687CB1 

<400> 33 

agaaaccagg cggcgcgttc ccggtggcgg cgccctggac tcccgggccc gcgcatcccc 60 
gccagccttc cttaaggcgg atgggtggcc cccgagaccc cgtcggaccc atggtttcca 120 
gtgcagcgcg gagtgggcga tgccagcgtg ccaggagcca tgtctgacca ggacgtttgg 180 
aagatcatat ccatgccaga ggctcttgtg aggagatgag ttggtaaaga gagaggctgg 240 
gatgaagatg ctgccaggag tgggcgtgtt tgggactggc agctccgccc gagttctggt 300 
cccactgctg agggcagaag ggttcactgt tgaggccctg tgggggaaga ctgaggagga 3 60 
ggcgaagcag cttgctgagg agatgaacat cgccttctac accagccgga ctgatgacat 420 
cttgctgcat caagatgtgg atctggtgtg catcagcatc ccccctccac tcacccggca 480 
gatatccgtg aaggctctag gtattgggaa gaatgtggtt tgcgagaagg cagcaacatc 540 
ggtggatgcc ttccggatgg tgacagcctc gcgctactac ccgcagctca tgagcctggt 600 
agggaacgtg ctgcgcttcc tgcctgcctt cgtgcgcatg aaacagctga tttcggaaca 660 
ctatgtggga gcggtgatga tctgtgatgc ccgcatctac tcaggcagcc tgctgagccc 720 
cagctatggc tggatctgtg atgagctcat gggcggcggg ggcttgcaca ccatggggac 780 
ctacattgtg gacctgctga cccacctgac cggccggaga gccgagaagg tgcacgggct 840 
gctcaagaca ttcgtgaggc agaacgctgc catccgtggc atccggcacg tcactagcga 900 
tgacttctgt ttcttccaga tgctcatggg tgggggtgtg tgtagcacag tgacactcaa 960 
cttcaacatg ccaggcgcct ttgtgcatga agtcatggtg gtaggctctg caggacgcct 102 0 
cgtcgcccgg ggagccgacc tctatgggca gaagaactct gccacgcaag aggagctgct 1080 
cttgagggac tcgctggcag tgggcgcagg actgcctgag caggggcccc aggatgtccc 1140 
gctgctgtac ctgaagggca tggtctacat ggtgcaggcc ttgcgccagt ccttccaggg 1200 
gcagggcgac cgccgcacct gggaccgcac ccctgtctcc atggccgcct ccttcgagga 1260 
tgggctgtac atgcagagcg tggtggatgc catcaagagg tcgagccgat ccggggagtg 1320 
ggaggctgtg gaggtgctga cggaggagcc cgacaccaac cagaacctgt gtgaggcact 1380 
tcagcggaac aacctatgag cctgcacctg ggctccttgc cacagggcag agggaccagg 1440 
gaggggaaca ggagccagac atgacaggga cttggcccta gcagagacag tgtctttcat 1500 
ttaatgagtc tgggtgaagc cagaggtggc tctgggactt cgccccctcc agtggctcat 1560 
ctgctcctca gacttgcagg gtggcaaatg ttcctgttgc caggttacca cggtgtgcac 1620 
agggcagagc tgccgccagg cctcctcatg gtgatgccag tgagtcggac caggaaaggc 1680 
tcaccctggg gagcccttca gcctgatccg cagtcctaac agggtgagaa agcacaagag 174 0 
gggccagctc ggctgaggcc tgaggagaaa tgatgagaag ttttgtttcg ttcttgggct 1800 
ggggcatctc aggaacagtg agggcaacaa gctgtccctc agggaccctc acctgccctc 1860 
ccatgtgacc aggcccttcc caggtgggga ctggggcttg ctttattaag tgagctcttg 1920 
ggcttttgaa gtaggacaga gggccctggt ttggcagaga ggaggcaagg gagggcttta 1980 
gaacatctga gaaatgttaa taaataattc agaaaataaa aaaaaaaaaa aa 2032 



<210> 34 

<211> 1134 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 1690348CB1 

<400> 34 

cgtggacttc taagaaagcg 
gtgctctgtt ggagccatcc 
gcgccatctt actctaacca 
aactgcaagt gtccttcgac 
tgagtcaccg tcagtgaaga 
agctggccag tgggttgatt 
atgctccagt cccagactgc 
gaaccaccct cctgccctct 
gagagtgggt ggagagttct 
gttgattgca ggaggagtcg 
tctcctcaga gagcaggcaa 
ctacagtgca aaaaatacca 
tgaatttcct gagaagattg 
tgcggaactc aagccataca 
tatttcaaaa gagactttgt 
caagcaactg gaaaacaacc 



ccatggcctg tgctgctgtt 
gtattgaggc tgcgtcactg 
gcataatgaa atccaaaagg 
gggagattgt ggcagcagct 
gcctccgctt gcttgttgct 
tctttattcc aggagtctct 
tagaacaaga gagagtgata 
gggttcacaa tacgtgtaca 
tctttgaccc tcagcctgcg 
gaattaaccc tctgctttcc 
acaaaagaaa tggatatgag 
gcgaactcct gtttaagaaa 
catgcagttt gcatgttaca 
tcacggaagg aagaataacg 
tctatatttg tggcccacct 
atgtacccaa agaacacatt 



atgattcctg ggttgttgcg 60 
agattgacac tcagcacttt 120 
aaaactgatc acatggagag 180 
aaggtgtgtg gagctgccag 240 
gatcaagact tttcctttaa 300 
gtggttggtg ggttttcaat 360 
gaattggcag tgaaatatac 420 
cttgactgtg aagtggctgt 480 
gatgcctcta gaaacctcgt 540 
atcctgcggc acgcagcaga 600 
ataggaacaa taaaactatt 660 
aatatccttg atttagtaaa 720 
aaacagacta cacaaatcaa 780 
gagaaggaga taagagatca 840 
ccaatgacag actttttctc 900 
tgctttgaga agtggtggta 960 
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ggaggcagac aaaggcagaa aaaataaaga ggtgagatct actcaggaga gctcctgtcc 1020 
tttgtggcat gattaatttt ttttatctct acttgagttg tcttattttt taaggctata 1080 
aacttagtga ccagctggat aataaaagcc agctggcaga cttaaatgat aaac 1134 



<210> 35 

<211> 734 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> unsure 
<222> 661 

<223> a or g or c or t, unknown, or other 
<220> 

<221> misc_feature 

<223> Incyte ID No: 1865603CB1 

<400> 35 

gggaggaccc tgggcaaaga 
ccagtcaaga tgtggggcag 
cctcccccat gtcttatccg 
gacaccacca tgttttattc 
cggaaagcac tgtgttttgg 
gaacccaaag ccgctcaccc 
cctttggagg aaatgatcca 
gtccccagaa caggggcaaa 
aatctgattg aggtgtccaa 
gtcccccttg atgtcgccct 
ctccaaagac tgaggactta 
nttgggacca aacctacatg 
ctcaatgaaa aaaa 



<210> 36 
<211> 2221 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 1976472CB1 

<400> 36 

gtcgggggct tggcctctgc ccggccacag agccggagct ggaggtgctg tcccgtctgg 60 
cggcgatccc cgggcagggc ccggggccgg ggtttgaaga tgctgaacgt cccttcccag 120 
tctttcccgg cccccaggtc gcagcagcgt gtcgcctccg gggggcgtac aaggtacctt 180 
taaaacaggg cagaagcctt atggattgga ttcgactgac caaaagtgga aaggatctaa 240 
cgggattaaa aggcaggtta attgaagtaa ctgaagaaga acttaagaaa cacaacaaaa 300 
aagatgattg ttggatatgc ataagaggtt tcgtttataa tgtcagccct tatatggagt 360 
atcatcctgg tggagaagat gaactaatga gagcagcagg atcagatggt actgaacttt 420 
ttgatcaggt tcatcgttgg gtcaattatg aatccatgct gaaagaatgc ctggttggca 480 
gaatggccat taaacctgct gttctgaaag actatcgtga ggaggaaaag aaagtcttaa 540 
atggcatgct tcccaagagc caagtgacag atacacttgc caaagaaggt cctagttatc 600 
caagctatga ttggttccaa acagactctt tagtcaccat tgccatatat actaaacaga 660 
aggatatcaa tttagactca ataatagttg atcatcagaa tgattccttt agagcagaaa 720 
caattattaa ggattgttta tatcttatac atattgggct aagccatgag gttcaggaag 780 
atttttctgt gcgggttgtt gagagtgtgg gaaaaataga gattgttcta caaaaaaaag 840 
agaatacttc ttgggacttt cttggccatc ccctgaagaa tcataattca cttattccaa 900 
ggaaagatac aggtttgtac tacagaaagt gccagttaat ttccaaggaa gatgttactc 960 
atgatacgag gcttttctgt ttgatgctgc caccaagcac tcatcttcaa gtgcccattg 1020 
ggcaacatgt ttacctcaag ctacctatta caggtacaga aatagtaaag ccatatacac 1080 
ctgtatctgg ttccttactc tcagagttca aggaaccagt tcttcccaac aataaataca 1140 
tctacttttt gataaaaatc tatcccactg gactcttcac accagagctt gatcgtcttc 1200 
agattggaga ttttgtttct gtaagcagtc ctgagggcaa ttttaaaata tccaagttcc 1260 
aagaattaga agatctcttt ttgttggcag ctggaacagg cttcacacca atggttaaaa 1320 



cacctaccct gccatgctgc 
gactttggag aaacagtcat 
tagacttgac cacatcgtga 
caagatcctg ggcatggagg 
agaccagaaa tttaacctcc 
agttcctggc tccctggaca 
gcacctcaag gcttgtgatg 
agggcctatc atgtccatct 
ctacatctcc tcgtgatgga 
ctcctttcct tcctgcaaac 
ggcacttcac acatcctgct 
tctgtatgtc atcaaagttg 



gccatctgcc ctccaggctg 60 
ggagggacag cagtcagacc 120 
tgacggtgaa gagcatcaaa 180 
tcatgacttt taaggaagac 240 
acgaggtggg aaaggaattt 300 
tatgtctgat cacagaggtg 3 60 
tccctattga ggaggggcca 420 
acttccgaga ccccgacaga 480 
ggctggacct cctccattct 540 
ccccacccag gcccagagat 600 
gaggggggac ccaagacagg 660 
gcctaataaa tgcataacct 720 

734 
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tactgaatta tgctttgact gatataccca 
ataaaacaga agatgatata atttggagaa 
aaagactgga tgttgaattt gttctctcag 
gacatatttc accagctctt ctttctgaat 
ttctcgtctg catttgtgga ccagtgccat 
atctcaactt ttccaaaaat gagatccata 
tcctttattc aactagttta tctaaatttg 
tttgtacata acaaaaggtt aactagaatc 
gttccttcag tacaggtaac ttcttggctt 
atatttgacc tggaaagtta atcatggcaa 
cacaaatttc cttgccattt aaattatatc 
gacatgataa agtaaatgat cacttcgatc 
caatacagat taaattttac cttgcactgt 
ttattaatgt taaaacatag aatgttataa 
tgtgtatggc attgatgcag aatagaataa 
a 



gtctcaggaa agtgaagctg atgttcttca 1380 
gccaattgga gaaattagca tttaaagata 1440 
cacctatttc tgaatggaat ggcaaacagg 1500 
ttttgaaaag aaatttggac aaatccaaag 1560 
ttacagaaca aggagtaagg ttgctgcatg 1620 
gttttacagc ataatgaaga gctgtcattg 1680 
tgattgctta gggtttttta agagaacatt 1740 
cagccttcag tttcttaaat gaaatcaaat 1800 
tcttttgtac cacaacttat tttactactg 1860 
caaatacata caggattctt tgttatgaat 1920 
actgttctac aataagcact tgtgttttat 1980 
atgtttctat gctgtaaaat gtcttattag 2040 
taactcagga aatgatcatt tatgtccttg 2100 
cttattaagt gatccaaaca tttttttgtg 2160 
aattatactt aagttctttt taaaaaaaaa 2220 

2221 



<210> 37 

<211> 1706 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2050821CB1 

<400> 37 

agaagaggtt gtgtgggaca agctgctccc 
tggctgggcc tcagaccggt ggcaatgtcc 
tcctggctac tcgcccgcat cctggcttgg 
ctccagtgtt tcccacagcc cccaaaacgg 
actcctacag aggagggctt gaaggactcg 
tttacggtat ggctgggtcc catcatcccc 
cggtctatca ccaatgcctc agctgccatt 
ctgaagccct ggctgggaga agggatactg 
cgtcggatgc tgacgcccgc cttccatttc 
aacaagagtg caaacatcat gcttgacaag 
cgtctggaca tgtttgagca catcagcctc 
ttcagctttg acagccattg tcaggagagg 
ctcagtgccc ttgtagagaa aagaagccag 
tacctctccc atgacgggcg gcgcttccac 
gacgctgtca tccgggagcg gcgtcgcacc 
aaagacaaag ccaagtccaa gactttggat 
gaagatggga aggcattgtc agatgaggat 
ggaggccatg acaccacggc cagtggcctc 
ccagaatacc aggagcgctg ccgacaggag 
aaagagattg aatgggacga cctggcccag 
agcctgaggt tacatccccc agctcccttc 
ctcccagatg gccgagtcat ccccaaaggc 
catcacaacc caactgtgtg gccggatcct 
gagaacagca aggggaggtc acctctggct 
tgcatcgggc aggcgttcgc catggcggag 
cacttccggt tcctgccaga ccacactgag 
gccgagggcg ggctttggct gcgggtggag 
gacccatcca cctgtttttt tgcagattgt 
aaaaaaaaac taaaaaaaaa aaaaaa 



gacagaagga tgtcgctgct gagcctgccc 60 
ccatggctac tcctgctgct ggttgtgggc 120 
acctatgcct tctataacaa ctgccgccgg 180 
aactggtttt ggggtcacct gggcctgatc 240 
acccagatgt cggccaccta ttcccagggc 300 
ttcatcgttt tatgccaccc tgacaccatc 360 
gcacccaagg ataatctctt catcaggttc 420 
ctgagtggcg gtgacaagtg gagccgccac 480 
aacatcctga agtcctatat aacgatcttc 540 
tggcagcacc tggcctcaga gggcagcagt 600 
atgaccttgg acagtctaca gaaatgcatc 660 
cccagtgaat atattgccac catcttggag 720 
catatcctcc agcacatgga ctttctgtat 780 
agggcctgcc gcctggtgca tgacttcaca 840 
ctccccactc agggtattga tgattttttc 900 
ttcattgatg tgcttctgct gagcaaggat 960 
ataagagcag aggctgacac cttcatgttt 1020 
tcctgggtcc tgtacaacct tgcgaggcac 1080 
gtgcaagagc ttctgaagga ccgcgatcct 1140 
ctgcccttcc tgaccatgtg cgtgaaggag 1200 
atctcccgat gctgcaccca ggacattgtt 1260 
attacctgcc tcatcgatat tataggggtc 1320 
gaggtctacg accccttccg ctttgaccca 1380 
tttattcctt tctccgcagg gcccaggaac 1440 
atgaaagtgg tcctggcgtt gatgctgctg 1500 
ccccgcagga agctggaatt gatcatgcgc 1560 
cccctgaatg taggcttgca gtgactttct 1620 
catgaataaa acggtgctgt caaaaaaaaa 1680 

1706 



<210> 38 

<211> 549 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> incyte ID No: 2408443CB1 
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<400> 38 

cccacgcgtc cgtgtcatgt acttttgtta 
tcaaggaggg caacatttta tggttacatt 
taaagcaaaa caatggctgg ttgaccataa 
agaaccattg cgtgcggaag atgtcaaagc 
agaattaatt tcaacacgct caaaaatttt 
gtcaattaat aaattgattg acctcatcgt 
tattcttgac gatcagcgca tgcaaattgg 
accacgtgaa gttcgtcagc gagagttaat 
agaggcaaaa gatttagtgg ttgaagaagg 
tgacaaaag 



atatttttgc tataataaac ttagtatatt 60 
atactcgtct ccgagttgca cctcttgtcg 120 
tctcccattt attgaacgta atttaaataa 180 
aatgttacga ttgactgaag atgggacgga 240 
ttctgagttg acgattgact tagatgatat 3 00 
catgtatccg tctttactga agcggccaat 3 60 
gtacaatgat gatgaaattc gtcgcttttt 420 
tcgcgcaaca tttaaagctg acttcgcaga 480 
ctgagcacgt cagtcttttt ttctttacca 540 

549 



<210> 39 

<211> 1363 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2508668CB1 

<400> 39 

cgtcaggctc tgtgttggtt ggagcgagca 
gaacccccgc ccggcagtgg cgggggctgc 
cctcctccta ctccgcatcc gccgagcctg 
acggggatcc agccaaggtc gtcgaactca 
cagatgtccg tgtgaagatg ctggcggccc 
aaggaaacta cggactcctt cctgaactgc 
aggtggtagc ggtgggcagc aatgtgaccg 
caaatgctgg tttaggaacc tggcggaccg 
aagttccgag tgacatccct cttcagagcg 
cctacaggat gttgatggac ttcgagcaac 
catccaacag cggagtgggg caagcggtca 
ccatcaatgt ggtccgagac agacctgata 
tgggggctga gcatgtcatc acagaagagg 
ttaaggacat gccccagcca cggcttgctc 
agctgctgcg gcagttagcg cgtggaggaa 
agcccgtcgt agcctctgtg agcctgctca 
ggttgtccca gtggaagaag gatcacagtc 
tgtgcgatct catccgccga ggccagctca 
aggactacca gtctgccttg gaagcctcca 
tcaccatgtg atcatcccaa aagagctgga 
gctgggtgca ggcccctcag ttggggctcc 
gcctcttcct attaggagga tggtgaagcc 
ggtatctaat aaagtctgaa ctctcccttc 



tgtgggtctg cagtaccctg tggcgggtgc 60 
tcccagcttc tggctgtcac ggacctgccg 120 
cccgggtccg ggcgcttgtc tatgggcacc 180 
agaacctgga gctagctgct gtgagaggat 24 0 
ctatcaatcc atctgacata aatatgatcc 300 
ctgctgttgg agggaacgaa ggtgttgcac 3 60 
ggctgaagcc aggagactgg gtgattccag 420 
aggctgtgtt cagcgaggaa gcactgatcc 480 
ctgccaccct gggtgtcaat ccctgcacag 540 
tgcagccagg ggattctgtc atccagaatg 600 
tccagatcgc cgcagccctg ggcctaagaa 660 
tccagaagct gagtgacaga ctgaagagtc 720 
agctaagaag gcccgaaatg aaaaacttct 780 
tcaactgtgt tggtgggaaa agctccacag 840 
ccatggtaac ctatgggggg atggccaagc 900 
tttttaagga tctcaaactt cgaggctttt 9 60 
cagaccagtt caaggagctg atcctcacac 1020" 
cagcccctgc ctgctcccag gtcccgctgc 1080 
tgaagccctt catatcttca aagcagattc 1140 
gtgacatggg aggggaggcg gatctgaggg 1200 
caccttcccc agactactgt tctcctcact 1260 
agccacggtt ttccccaggg ccagccttaa 1320 
caaaaaaaaa aaa 13 63 



<210> 40 

<211> 1196 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 253 683 0CB1 

<400> 40 

ggttgccagg caaccgcgaa ccgcggtccc 
attcgtcatc gcgggtagag cagattgtcc 
ctatttacaa aagaatcttc ctgattttcg 
ttgggaggat tggctaaaag atgtgtgtga 
tatcatctgg agagagctgt tggatcgtgg 
tgagttcctg gagcatgctc agctttacta 
gatgatggta attgctcaag agaacctggg 
agccctgaaa acttgcatca accccttgca 
ctgctacaac ctaattccca tattgacgag 



tgcctctgag tctctctcga ccatggccaa 60 
a tat tat get aaaacagaac ttgtggcaga 120 
gatacataaa atcacacaac gtcctgaggt 180 
aaagaataag tggagtcaca agaattcccc 240 
aggaaagggt ttgcttttgg gaggatataa 300 
tgatgtcacc tctagcatga cgactgaact 3 60 
ggcacatata gaaaaagagc aggaggaaga 420 
ggtctggatc accagtgcct ctgctcctgc 480 
tggcgaagtg tttgggatgc atacagaaat 540 



31/40 



WO 01/44448 



PCT/US00/33158 



tagcataact 
gacccaagac 
ggccttccgc 
cactctggag 
agagaaaaat 
gaagacagtt 
gctgggggtg 
atgtgagtga 
agctaggaag 
aacagacata 
cttttgtgtc 



ctatttgaca 
ctggcatctc 
caggcccacg 
gactgcctcc 
gctcatgagt 
ttactcatga 
gaaggtgaag 
atgtctcttt 
aacactgtcc 
aagaacttat 
tggctttttc 



acaagcaggc 
ccgtcctgcg 
tcattgtggt 
gaagcagggt 
ctgtcagagt 
gatatgcccc 
cgaaagccat 
atgtttttgt 
cctctaggaa 
tgtgtgattt 
catcattata 



ggaagaacat 
cagtgtctcc 
gctggatgac 
gcctctctgc 
catcgtggga 
acgcattgca 
actggccaga 
ttttattttg 
cattctgagt 
gactactgtg 
aaaatcaata 



ctcaaaagcc 
atctgcacga 
agcaccaaca 
aggctctatg 
gggagaacct 

cacaacatta 
aaactgaaga 
aaacaacatg 
atctggaata 
caaacatttt 
aatatttaat 



ttgtggtgga 
aggtggagga 
aggaggtgtt 
ggtacctgat 
ttgtaaacct 
ttgctgtggc 
cagctccttc 
atttcaaagc 
ggaagtataa 
tttttactgg 
aagaaa 



600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1196 



<210> 41 
<211> 1926 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 2645179CB1 

<400> 41 

gcttcctctc ctggagtcca gagtcccgct gctccgccgc gatattcagt aaaccactgg 60 

gagtccggca gcatggaggc agcgcgccct cccccgacgg cagggaagtt cgtggtggtc 120 

ggcggcggca tcgcgggcgt cacttgtgcg gagcagttgg ctactcactt tccatcggaa 180 

gatattctct tggtaacagc ttctcctgtt attaaagcag ttacaaattt caagcagatt 240. 

tctaaaatat tggaagaatt cgatgttgaa gaacaatcaa gtaccatgtt aggaaaacgc 3 00 

tttcccaaca ttaaggttat agaatctggc gtaaagcaac tgaagagtga agaacactgc 3 60 

attgtaacag aagatggcaa tcagcacgta tataagaaac tctgtctgtg tgctggagct 420 

aaaccaaagt tgatatgtga aggaaatcct tatgtattag gaatccgtga tacagacagt 480 

gctcaggaat ttcagaaaca gcttactaaa gctaaaagaa taatgatcat agggaacggt 540 

ggtattgcac ttgagttagt gtatgaaatt gaaggctgtg aagtgatttg ggccattaaa 600 

gataaagcta tagggaatac tttcttcgat gcaggagcag ctgaattctt gacttcaaag 660 

ctcattgctg aaaaatcaga ggctaaaatt gcacataaaa gaaccagata tacaactgaa 720 

ggaaggaaaa aggaagctag aagcaaatct aaagcagata atgtaggaag tgcattggga 780 

ccagattggc atgaaggctt gaatcttaaa ggaacaaaag agttttctca taagattcac 840 

cttgaaacta tgtgtgaagt aaagaaaatc taccttcagg atgagtttag aattttgaag 900 

aaaaagtcct tcacttttcc aagagaccat aagtcagtta cagctgatac agagatgtgg 960 

cctgtctatg tggaattgac caatgaaaag atatatggct gcgatttcat tgtcagtgct 1020 

acaggagtta caccaaatgt agaacctttt ctccatggta acagttttga tctaggagaa 1080 

gatggtggcc tgaaagtgga tgatcatatg cacacatccc ttcctgatat ctatgctgcc 1140 

ggtgacatct gtactacatc ctggcagctg agcccagtct ggcagcagat gaggctgtgg 1200 

acccaggcta gacagatggg atggtatgca gcaaagtgca tggctgcagc gagttcagga 1260 

gactctattg acatggattt cagctttgaa ctgtttgctc atgtgacaaa attttttaac 1320 

tataaggttg tactgctggg aaaatacaat gcacagggct taggttcaga tcatgaatta 1380 

atgctgagat gtaccaaagg acgagaatac atcaaagtcg tcatgcaaaa tggacgaatg 1440 

atgggagctg tcttaattgg tgaaaccgat ttagaagaaa catttgaaaa cctaatctta 1500 

aaccaaatga atctttcatc atatggagaa gatctgctag atccaaatat tgatatagaa 15 60 

gattattttg actaaaaatg gaatttcttc aggaatcata taaagttcca aatgacacca 1620 

gaaaaatcac aagtcaataa aatgaatgac tgtattgagt taatgatgac cacactgaaa 1680 

attacagaag tgataatgat attagtggaa aaatataaaa acataaattc taagtttgaa 1740 

atcagttcaa agtttattta tagatatatc tttccaatac aacactgacc gcttagataa 1800 

aaatcttaag ttatttattt ctgtgtttta aacataaata tgtttacttg tgatttagct 1860 

ttggagcaaa tttaggtaag ttatctactt agccaaatgt actctagtag actagaacca 1920 
ttcttt 1926 



<210> 42 
<211> 1727 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: 2754425CB1 
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<400> 42 

agggaccgac 

cttttgccgg 

tggcagtatc 

cccacgctaa 

ttgcttcaca 

agctggtcgc 

tgcgacacca 

tatcacgtgg 

gccgatgcca 

tgcacccaca 

cgctactcct 

ctcaacagcc 

acgccctatt 

cagaccatcc 

caaggatgga 

cagggggttt 

agccttcgga 

cctgggactg 

ggtctgaaaa 

gaagggcgct 

gcagaggatg 

gccgaacggc 

tggagtcggc 

accggtgttg 

atatcatgtc 

ttccttagat 

acagcgcatt 

tcttctgttg 

tctgaatata 



ggcacttgat 
aaagtggagc 
tgaggatcgg 
tctgcatggt 
cgcccagtaa 
tggatggcct 
ccgctgtcat 
gcagggagca 
acgggctgca 
acgagggcct 
ggtccggcat 
ggccctccat 
tctcccggga 
tgtgtgagtt 
aaatgaacag 
gtgttcccag 
cctgtattgg 
tgataacgga 
ctccaaatgg 
gccttctctt 
gcccacgggt 
aggctcttga 
gagaagggcc 
tttccatgct 
gggtccctct 
tttttttttt 
cctttgattg 
ctacttgttt 
ataatgtaaa 



aatgtgacaa 
agtgggcacg 
tggcagccat 
gtgggcgccc 
ggactccccc 
gagggactgc 
cactgtggcc 
gccccggccc 
gaatggctac 
caaccagaag 
gggccgcatc 
ccagaagccc 
cgcacagaaa 
tgagaccctc 
cacccccagc 
gaactgtagg 
gaacaatgtt 
gcactatgga 
ctgtgagctg 
tgatgactct 
ggttttcatg 
tttcatcttt 
gaggcggggc 
cagaaacctg 
ttcccttggt 
tccttccaat 
gtccttgagt 
ttttcagtgc 
acttcttgtg 



aaaccagcct 
gccaaccttg 
gcctccccct 
ttgggacccc 
aagatgtcgc 
atcgccaccg 
tgcctcctgg 
tacgtctccg 
gtgtactgcc 
ctgtaccaca 
cacaagggca 
gaggtcttct 
catgatgtgg 
tacaaagctt 
ggggagtggt 
aagtgcccac 
tttgggaacg 
cccaccaaca 
gtggtggggg 
ttcctgcatg 
gtggatttgt 
gctccgggac 
ctgggcagac 
cctcagcgga 
tattgtaaat 
catttgcttc 
gaccagagac 
tctgaaatag 
gtcatcttaa 



cttccacccc 
tcgttcatca 
gccccagccg 
cgaggactga 
tcgagtggct 
gcatccagtc 
tcctcttcgt 
tcaactccct 
agtcccctga 
acctgcagga 
tccgcgagca 
tcctgcccga 
aagtgctgga 
tctcaaactg 
tcacctttta 
ggacgtaccg 
cgtgcatctc 
tccgcatccg 
gagagcccca 
ctgcgttcca 
ggcatccaaa 
gatgagagta 
tgtggtccgg 
aagctcttat 
ggaaactttt 
agagactcct 
ttagtgccct 
agtaactaaa 
aaaaaaa 



actgcagtga 
atgccgcccc 
ctccttcccc 
ttgtctgacc 
ggtggcctgg 
cgtgcgggac 
gtggtactgt 
catgcaggct 
gtgcgtgcgc 
gtacgccaag 
gggccggtac 
cctgcccacc 
acggaacttc 
cagcctcccg 
cttggtcaat 
cttgctcgga 
tgtgctgagc 
atgccattta 
gtgctgggca 
tgaaggttca 
cgtcgcagcg 
tttcccatgc 
tccagtccct 
ttgggatttt 
cggcttgtat 
ttctggccta 
tgtaagtctg 
tggttatttg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1727 



<210> 43 
<211> 611 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 2821526CB1 

<400> 43 

cccatccggt gggttttgct cggcgcaccc gggaggcggg gccagcgagg caagatggag 60 

ttagtgcagg tcctgaaacg cgggctgcag cagatcaccg gccacggcgg tctccgaggc 120 

tatctacggg tttttttcag gacaaatgat gcgaaggttg gtacattagt gggggaagac 180 

aaatatggaa acaaatacta tgaagacaac aagcaatttt ttggccgtca ccgatgggtt 240 

gtatatacta ctgaaatgaa tggcaaaaac acattctggg atgtggatgg aagcatggtg 300 

cctcctgaat ggcatcgttg gcttcacagt atgactgatg atcctccaac aacaaaacca 3 60 

cttactgctc gtaaattcat ttggacgaac cataaattca acgtgactgg caccccagaa 42 0 

caatatgtac cttattctac cactagaaag aagattcagg agtggatccc accttcaaca 480 

ccttacaagt aaagacaatg aagaacagtt gaaacatgca aaatatggag cttttcatgt 540 

aattactctt ttactgttta ccattcacta taattcacaa ttaaaattgt gtgactaaac 600 

aaaaaaaaaa a 611 



<210> 44 
<211> 1352 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 2876494CB1 
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<400> 44 

gttttattga caatacatgc atcatatctt 
aatcaagtta tgatttatag aggattcagc 
ggcaaaacgc ctaccgagac atgaaggttt 
ggttggcctt ctacctgcct ttggtggtga 
agctgcaaga agctgtgggg aaagttatca 
gccttggcta taaggaggag accgtctgtg 
gtcagactca gcgcttagaa tgtctgacca 
ttctcattgg caaggaattt gagcttagct 
aggaagcttt ccggttcacc tggagacttg 
tcttcaaacc ctttcaagcc aactcccact 
actctgggac atatcgctgt gatgtgcagc 
tctattttgg gttgagggtc cttcctccta 
ttactgagga tcagaagtta atagatgagg 
agcctcacca cccaaagtgg aaaaagaagg 
ttggagtggt tggtggcgtg ttggtgagga 
agcagtgaca gcttcaagaa cttaacagcc 
ccaagctagc tttttagggg agtgttccag 
actcccacag ccaaaagaat gagtgggaga 
cgcagtaacc taacttcctt atgtcccatg 
gtacccagga aactgcaagc attgcctgtg 
teat ttt eta ccctttatga ettggatgee 
gtgtattcat gtagagggat gtattcagee 
taattcacag tagctgtatg gagacctttt 



ttgactttga aggatatctc atgtcaaagg 60 
tggaatacct tgtgggtgct ggctgagggt 120 
tagecactag ttttgtcctt gggagcctgg 180 
ctacacctaa aacactggcc atccctgaga 240 
tcaatgccac aacctgtact gtcacctgtg 3 00 
aggtgggccc tgatggagtg agaaggaaat 360 
actggatctg tgggatgetc catttcacca 420 
gtctgagttc agacatcttg gagtttggac 480 
ctcgaggtgt catctccact gacgatgagg 540 
ttgtgaagtt taaatatget caggagtatg 600 
tggtaaaaaa cttgagactt gtcaagaggc 660 
acttggtgaa tctgaatttc catcagtcac 720 
gattggaagt taatctggac agctactcca 780 
tggegtcage cttgggaata ggaattgeca 840 
ttgtcctctg tgegctaagg gggggcctgc 9 00 
ttgctcctga agaactggct geccaggaag 960 
ctgctggtag tggatcagct tagagggaac 102 0 
aatggagggg acaatctcct gggagctatg 1080 
gatctcttcc tgatcttccc tgcccattgg 1140 
ttcctgggaa gagttctaag aagcttgeat 1200 
tccccacctc catttcccct cttctgagct 1260 
tttttagtga acattttttt tcaataaaag 1320 
tc 1352 



<210> 45 

<211> 1458 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 3403225CB1 



<400> 45 

ggaeggagag gagcagagaa gcagcagaag 
tgagecagag ctggggttga agctggagca 
ggaacgatgc tggtcacctt gggactgetc 
gctccatcca tcaggaagtt ctttgctggt 
ggcaaggtag tggtgatcac tggcgccaac 
ctcgctagcc gaggageccg agtctatatt 
getgecagtg aaatccgagt ggatacaaag 
ctatccgaca ccaaatctat ccgagccttt 
ctccatattc tgatcaacaa tgcgggagta 
ggctttgaaa cccacctggg agtcaaccac 
cacccaggcg tcgtccgctc tgagctggtc 
cggctcttct ccccctttgt caagaeggea 
gccctggctg agggcctgga gcccctgagt 
tgggtgtctc caagggcccg aaataacaaa 
gage ttc tag gaatccggtg ggagtagctg 
tccatgccat aatgaacagg gaccaaggag 
ccagctggtg ctgcgaatcc tgcctgctct 
acacctgaca ctcttgtgag actggcttat 
cttctctaag acctggaaag tcagcaaccc 
ggctgggcat gggggtggca gaagagcccg 
agaggctcag ctgaggcaag aagagcacca 
caactcttgg ttgatctctt tctttttaaa 
aacacacaaa gatcctggct aaccctggcc 
tcccatttca ateatgeaga tggtttcttt 
ttagagaaaa taaagact 



cagecaagag ctggagccag accaggaacc 60 
gcagcaaaag caacagcagc tacagaagtt 120 
acctccttct tctcgttcct gtatatggta 180 
ggagtgtgta gaacaaatgt gcagcttcct 240 
aegggcattg gcaaggagac ggecagagag 300 
gectgeagag atgtactgaa gggggagtct 3 60 
aactcccagg tgctggtgcg gaaattggac 420 
gctgagggct ttctggcaga ggaaaagcag 480 
atgatgtgtc catattccaa gacagctgat 540 
ctgggcaccg gggtcaccac etaegcagtg 600 
cggcactcct ccctgctctg cctgctctgg 660 
egggaggggg cgcagaccag cctgcactgc 720 
ggcaagtact tcagtgactg caagaggacc 780 
acagctgagc gcctatggaa tgtcagctgt 840 
gtggaagagc tgcagcttta tcaggcccaa 900 
aaggccaacc ctaaaggatt gtcctcttgg 960 
gatcctcttg acccttctgg gaatgtttgc 1020 
ggcatgagtt gtggacacct atagagtgtt 1080 
tctgggggca gcaggactgg gcagatccca 1140 
agaaattggg tcagttccct catcagcacc 1200 
tcactgccta tttctagggg ctatacactc 1260 
aatatttgee accaccctgg agtctagacc 1320 
tatttagatt ccttcctctc acctggacct 1380 
ttgtaaagag ttccgtttgc ctttcaattt 1440 

1458 



<210> 46 

<211> 1884 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> misc_feature 

<223> Incyte ID No: 4163943CB1 

<400> 46 

ggactccaag cgccatggcc gctgccgccc 
tgcaacgcgc agcgtgccag tgcccaactc 
tttcaccttc tgggaaaaca acagattatg 
atggagcagc agttacaaag gaagtaggaa 
tgtgcttgat gacagacaag aacctctcca 
ccctagtgaa gaatggcatc ccctttacgg 
attcaagctt catggaagct attgagtttg 
ctgtcggtgg tggctctacc atggacacct 
ctcattctga tttcctagat tatgtcagtg 
tgcctcttaa gcctctgatt gcagtgccaa 
gggttgccat ttttgactat gaacacttga 
tcaaacccac actgggactg attgatcctc 
tcgccaacag tggctttgat gtgctttgcc 
accacctgcg gagcccctgc ccttcaaatc 
acccaatcag tgacatttgg gctatccacg 
gggctgtcag aaatcccgat gatcttgaag 
ttgctggcat cggctttgga aatgctggtg 
tttcaggttt agtgaagatg tataaagcaa 
cccatggcct ttctgtggtg ctcacgtccc 
ttccagagcg acacctggag atggcagaaa 
tccaagatgc agggctggtg ttggcagaca 
ttgatgatgg cctagcagct gttggttact 
gaacgctgcc ccaggaaagg gtcaccaagc 
tggctgctct gtttgaagct tcaatgaaac 
taccgctggc cattgtagtg ctgagagcaa 
atctttgtgc ataacttacc tgttaccagt 
aattccccaa agctcagagt ccagttcctt 
tgttagaccc ccaggctcga cttcaggggt 
tactctgcct ctgcttcatg tagcaaatga 
attatatttc ctatgcctag taattcactt 
tagcaccagt acatacagaa gcatggcaag 
aaagatttga aacaaaaaaa aaaa 



gagcccgggt cgcgtacttg ctgaggcaac 60 
attctcatac ttactcccaa gcccctggac 120 
cctttgagat ggctgtttca aatattagat 180 
tggacctaaa aaacatgggt gctaaaaatg 240 
agctccctcc tgtgcaagta gctatggatt 300 
tttatgataa tgtgagagtg gaaccaacgg 3 60 
cccaaaaggg agcttttgat gcctatgttg 420 
gtaaggctgc taatctgtat gcatccagcc 480 
cccccattgg caagggaaag cctgtgtctg 540 
ctacctcagg aaccgggagt gaaactactg 600 
aagtaaaaat tggcatcact tcgagagcca 660 
tgcacaccct ccacatgcct gcccgagtgg 720 
atgccctgga gtcatacacc accctgccct 780 
ccatcacacg gcctgcgtac cagggcagca 840 
cgctgcggat cgtggctaag tatctgaaga 900 
caaggtctca tatgcacttg gcaagtgctt 960 
ttcatctgtg ccatggaatg tcttacccaa 1020 
aggattacaa tgtggatcac ccactggtgc 1080 
cagcggtgtt cactttcacg gcccagatgt 1140 
tactgggagc cgacacccgc actgccagga 1200 
cgctccggaa attcttattc gatctggatg 1260 
ccaaagctga tatccccgca ctagtgaaag 1320 
ttgcaccccg tccccagtca gaagaggatc 1380 
tgtattaatt gtcattttaa ctgaaagaat 1440 
gagctgatct agctagggct ttgtcttttc 1500 
ataggtggga tatacattta tcttgcagga 1560 
ccataaaaca ggctggacaa atgaccacta 1620 
cagtgttcct gtcccaaacc ccacacagaa 1680 
gcaaaaactc agtatctatc aaaagtgtaa 1740 
catgtctaaa aatttatctg atagaaacac 1800 
gatgtttctg gcagcacttt tctaataata 1860 

1884 



<210> 47 

<211> 1400 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: 4293484CB1 



<400> 47 

ctccaggcca 

tcccgccaaa 

cagcaggtgg 

actctgcaca 

agcacgtggt 

accaacacct 

cttgtgtgca 

ctaacaccaa 

acaggcaccc 

ggaggaaggt 

acggtgaagg 

ggctcttccg 

cctggatgca 

aggtatactt 

tcctgggccc 

tggtgttcct 

acgcacccct 

ccgacaaggc 



gtctgggcac 
cgccagcctc 
cage cage cc 
ggcccagaag 
gcgaatgctg 
gggtccctgg 
gaeeggaaca 
aggtcacccc 
ctatccttgc 
ccgtgggggg 
ccaccagatc 
ggccatcccg 
cgtgctggca 
ctgc tacgat 
ctgcggactg 
ggctgccctc 
gctgaacccc 
tcagcgccat 



cc tgggatag 
gtcaccgctc 
ctggatgagc 
ctggtgtacc 
ctgcagcggg 
ctggaggagc 
cggttcctgg 
ttctacaggg 
agcaaggccc 
ctgcccctgg 
atgagggact 
gcctctgtgg 
gcccgggagc 
ggatcaccct 
cggctggtgg 
aatgccctgc 
tacaegctgg 
ttcggctatg 



cggctgcagc 
cagggcacct 
caaggtctct 
tggtcacagg 
agccccggct 
tgaagacagg 
tctacaccag 
gcaacgaaga 
tggccgagtg 
tgacgtgtgc 
tctaccgcca 
agcatggccg 
tggagcagcg 
acaggagcta 
gcgcccgccc 
tgcagtggct 
ccgtggccaa 
agcccctgtt 



tcccaggatc 
ccagcagtaa 
tccccagcca 
gggctgtggc 
eggggagctg 
tacccggaac 
cagcatggaa 
caccccatac 
gctggtcctg 
ccttcgtccc 
gggcctgcgc 
ggtctatgtg 
ggcaaccctg 
cgaggatttc 
attgctgccc 
gctgcggcca 
caccaccttc 
ctcgtgggag 



agctctgccc 
caggtggttg 
ggcatggccg 
ttcctgggag 
egggtctttg 
gtgatcgagg 
gttgtggggc 
gaagcagtgc 
gaggecaacg 
aegggcatet 
ctgggaggtt 
ggcaatgttg 
atgggcggcc 
aacatggagt 
tactggctgc 
ctggtgctct 
accgtcagca 
gatagcegga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
84 0 
900 
960 
1020 
1080 



35/40 



WO 01/44448 

cccgcaccat tctctgggta caggccgcta cgggttcagc ccagtgacgg 
gcctggaggc ccagatacag cacatccacc caggtcccga gccctcacac 
aagggacagc tgcattccag agcaggaggc agggctctgg ggccagaatg 
tcgtagagcc ctccacattt tctttttctt ttttgagaca gggtcttgct 
gactggagtg cagtggtgtg atcatagctc actgcaccct caacctcctg 
atcctcctgc ctcagcctcc 



PCTYUSOO/33158 

tggggctggg 1140 
cctggacggg 1200 
gctgtccttg 1260 
ctgtcaccca 1320 
ggttcaagca 13 80 
1400 



<210> 48 
<211> 1313 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 4440080CB1 



<400> 48 

gaggctggct 

tcataggagc 

agatgtgacc 

cacagcggtg 

tcaggcagca 

tgcacacggg 

aacatcaata 

gccaccaagt 

cagacccaca 

aaactccacg 

aaacccgagg 

attggagaca 

cttccctccc 

gatcacggga 

tttttatatc 

gaggtggtgt 

ctttgtctgc 

atcttcttgc 

ttatatctgt 

tctccccctt 

ctggggtcat 

gtggatttca 



agggagccgt 
tggctgctga 
tatcaaatga 
tagacatctg 
ccagtggttg 
aagcctacca 
gcatgtctgg 
atgccgtcac 
tccgagccac 
acaaggaccc 
atgtggccga 
tccagatgag 
cacccttcat 
taccacttcc 
atcttgtcaa 
ccctaattgt 
tctcagtgtc 
acctcaacgt 
gttgttatcc 
atctatctcc 
ccctccactc 
tggtgatcat 



gggagtgagc 
atgtaagagt 
agaggacatc 
catcaacaat 
gaaggacatg 
gtccatgaag 
ccaccgagtg 
tgcgctgaca 
gtgcatctct 
tgagaaggca 
ggctgttatc 
gcccacggag 
ggcttgcctc 
tgtccacacc 
attgcttcag 
tttacttgtt 
ttccctttga 
ctgtggctca 
agggctccag 
ttctcggctc 
tgactctgac 
taaaaaagaa 



tgctcacccc 
gcaggctacc 
ctctccatgt 
gctggcttgg 
ttcaatgtga 
gagcggaatg 
ttacccctgt 
gagggactga 
ccaggtgtgg 
gctgccacct 
tacgtcctca 
caggtgacct 
ctgcctctgg 
ccgaccaggg 
ttgtaaatgt 
aacttgttct 
catgggaaag 
gggctggggt 
acttcctcct 
cccagcccag 
tatggcagca 
aaatcgcaac 



agtcagaccc 
ccgggacttt 
tctcagctat 
cccggcctga 
acgtgctggc 
tggacgatgg 
ctgtgaccca 
ggcaagagct 
tggagacaca 
atgagcaaat 
gcacccccgc 
agtgactgtg 
attttaggtg 
gctagaaaat 
gaaaaatggg 
tgtgcccctg 
gagttgtggc 
ggcagaggga 
ctgcctgccc 
tcttggcttc 
gaacaccagg 
caaaaaaaaa 



ctgttgggtt 
gatcccctac 
ccgttctcag 
caccctgctc 
cctcagcatc 
gcacatcatt 
cttctatagt 
tcgggaggcc 
attcgccttc 
gaagtgtctc 
acacatccag 
ggagctcctc 
ttgatttctg 
ttgtttgaga 
ctggggaaag 
ggcacttggc 
caaaatcccc 
ggccttcacc 
cactgcaccc 
ttgtcccctc 
gcctggccca 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1313* 



<210> 49 

<211> 1459 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 



5495687CB1 



<400> 49 

gggccgcgcc 

tcgcgtagcc 

gaccgccccg 

ttcccgtccg 

ctgctcaccc 

cgatgggcca 

gcgatcacat 

agcctagcgg 

agagcctttt 

acacagagcc 

aaagtctgaa 

tcactctgaa 

gtgactgtac 

ctcactttaa 

ctcagcacag 



cgcgctccca 
gtgcgccgat 
cgtcatgcgg 
cggcgtggag 
tctccaggtg 
ggacagggca 
ggtgatgctg 
cgtcacctgt 
ctctctggat 
agaagtggcg 
atccccaaag 
aattttaaat 
tctagtcctg 
ctctctgccc 
cagcctttct 



ggctctcctc 
tgcctctcgg 
ctcctcggct 
gttgcagagg 
ggggctgtgt 
gcagaagagg 
tctgtgattc 
ggtgctggag 
ggcgctggag 
gaatctgacg 
gtgaactgtg 
atgtcacagg 
ttttacaccc 
cgggcatttc 
accaggtttg 



ccccagcctt 
cctgggcaat 
ggtggcaagt 
aaagtggtcg 
acctgggtga 
ccaatgcggt 
ctggggaagc 
gagcggagga 
cacacttccc 
cagccccgac 
aggagagaaa 
accttatgga 
cgtggtgccg 
cagctcttca 
gcaccgtagc 



cctccggctg 
ggtcccggct 
attgctgtgg 
cttatggtca 
ggaggagctc 
gctggggctg 
tgaggacaaa 
ctcaaggtgc 
tgacagagaa 
agaggactcc 
cattacagga 
ttttctgaac 
cttttctgcc 
ctttttggca 
tgttcctaat 



gcagcacgac 60 
gccggtcgac 120 
gtgctgggac 180 
gaggagcagc 240 
ctgcatgacc 300 
gacacccaag 3 60 
gtgagttcag 420 
aacgtccgag 480 
gaggagtatt 540 
aataacactg 600 
ttagaaaatt 660 
ccaaacggta 720 
agtttggccc 780 
ctggatgcat 840 
attttattat 900 
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PCT7US00/33158 



ttcaaggagc 
aaatcttcat 
ccgaccaaat 
tttccttatt 
ggtggctaat 
ggaaagagga 
ccagtgacgt 
aatgtataaa 
atattcaata 
gtaatccagt 



taaaccaatg 
ttttaatcag 
aggccctctt 
ctttttaatt 
tccaggacaa 
acttcaatcc 
gttgacttga 
aaaattataa 
gatgcactat 
ttgaaatgg 



gccagattta 
acaggtatag 
cccagcactt 
agttttatta 
gagcaggaac 
ttcgtttcag 
aacttcaggc 
actggtgttt 
tcttgttttt 



atcatacaga 
aagccaagaa 
tgataaaaag 
tgtatgctac 
atgtggagta 
aaattagtgc 
agattaaaag 
taactagtat 
actgcatgaa 



tcgaacactg 
gaatgtggtg 
tgtggactgg 
cattcgaact 
gtgatggtct 
tacagtttca 
aatcatttgt 
tgcaataagc 
cgtaattcag 



gaaacactga 
gtaactcaag 
ttgcttgtat 
gagagtattc 
gaaagaagtt 
tacattttct 
tgaacaactg 
aaatgcaaaa 
tatttggaaa 



960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1459 



<210> 50 

<211> 2101 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 552773 5CB1 

<400> 50 

cggctcgagg cggccgtcat ggcgtcgccc 
gatgacttca tcgggccgtc tcaggagtgc 
ggaagtggcg tggccaagat tcgcattgaa 
gacggcggga cccggaggct ggagaaggcc 
agcggctgca tcacctccgc agagaccgtg 
aagaaggttc tagatgctaa caagatggcg 
tcggtctcac cacagtctag agcatcgctg 
actgccagga aattaacctc attctttaaa 
gccttctcaa ggcacttcag cctcctggag 
ggacaggccg actgcagaca ggcactgccc 
tgctatgccg agaagactca cggcagcttc 
ccgcagcagg tcatgggctc cc tggtcaag 
cctgacaaga tctaccacgt cacagtgatg 
agacccgact ttttcaacca ggagcaccag 
ggagaagttt tcaggttgct ggaggaagag 
cctctggaca gcctgtgcag cggtgcctct 
ggctcggggg gctacctgga gcatgtgttc 
catgtggctg aggttaccta caaacccctg 
gagaaggagg gccaggtgct gctgcacttc 
aacctggtgc agaggctcaa acgagggcgc 
tgcccctcag gctgcctgaa cggcgggggc 
gage tec tec agcacgtgga gagactgtac 
gcgcctgggg ttcaggagct gtacacacac 
ggtcgcttgc tgeatacgea gtaccacgcc 
cggtggtagg ggctgcagga ccaggactcc 
accacatgcc ccaagacccc agggcttccc 
tgggacccga gtaggagcta ggactageca 
gggtgcctct gggttcccac tggctctgcc 
aggttccctg aggtcccaga gcctgttccg 
cccttgctgc tgctggtcgg gcacccaagt 
gectgaggea gagcaagacg ggttctcacc 
tgtgcaaaag gtgggagaca gagctggacc 
gtggaccctg gcaggggggg ggtgccaggc 
ctgtggtccg gggcaageca agtacgaagc 
gccaggtccc caccaggcct gtctcccagc 

g 



ttcagcgggg cgctgcagct gaeggacctg 60 
ateaagectg tcaaagtgga aaaaagggcg 120 
gatgaeggga gctacttcca aattaaccaa 180 
aaggtctege taaacgactg cc tggcgtgc 240 
cttatcaccc ageagageca cgaggagctg 3 00 
gcacccagtc agcagaggct ggttgtagtt 3 60 
getgeaeggt ttcagctgaa tcctacagat 420 
aaaatagggg tgcacttcgt cttcgacacc 480 
agecagegag agtttgtgcg gegattcega 540 
ctgctggcct ctgcctgccc aggctggatc 600 
atcctccccc acatcagcac cgcccggtcc 660 
gacttcttcg cccagcagca gcacttgacc 720 
ccctgctatg acaaaaagct ggaagcctcc 780 
acaegggatg tggactgtgt cctcacaaca 840 
ggcgtctccc tccccgacct ggaaccagcc 900 
gcagaggagc ccaccagcca teggggaggg 960 
cggcacgcgg cccgagagct ctttggaatc 1020 
aggaacaaag acttccagga ggtgacactg 1080- 
gcaatggcgt acggcttccg caacatccag 114 0 
tgcccctacc actacgtgga ggtcatggcc 1200 
cagctccagg ccccagacag gcccagcaga 1260 
ggcatggtcc gggctgaggc gcccgaggac 1320 
tggctgeagg gcacggactc ggagtgtgca 1380 
gtggagaagg ccagcactgg cctgggcatc 1440 
caggaggecg tgtccatgtg tgacagcaga 1500 
ccaaaattct gagtgagctg cagggtgtgc 1560 
ggacccgcag ccgcctcgtc acctccagtt 1620 
caggtggggt ggggtggccc aggcagcaga 1680 
ttggccctgg gccgaggccc acaggtgctg 1740 
gcgtgagggg cttcagcctg teceggggtt 1800 
cctgacttct ggaggcttcc cttgaagctc 1860 
tgcaggggtg gtcccgccac aaccctgcgt 1920 
ccctggaaag caggggttac cgttacgagg 1980 
agcagccatc gegggctgea tcatccccca 2040 
gtttgtctaa taaacgcacc ectegtaaaa 2100 

2101 



<210> 51 

<211> 2440 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
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PCT/US00/33158 



<223> Incyte ID No: 5540437CB1 
<400> 51 

gtgtccctgc ggcgctaaga aggggagact 
tgagcggctg cgggctcttc ctgcgcacca 
tggtctctac cgcgaaccgg cggctactgc 
aagagctttt cctaggcaaa atcaagaaga 
aagatgaact taatgaaatc aatcagttct 
aggtggactc ccgaaaaatt gaccaggaag 
tgaagagcct agggcttttt gggctgcaag 
ccaacaccat gtactcaaga ctaggggaga 
ccctggcagc gcaccaggct attggcctca 
agaaagccaa atacttgcct aaactggcgt 
cggagccagc cagtgggagc gatgcagcct 
acaagaagca ctacatcctc aatggctcca 
atatttttac tgtgtttgca aagactgagg 
aaatcacagc attcatagta gaaagagact 
ataaattagg cattcggggc tccaacactt 
ctgtggaaaa catccttgga gaggtcggag 
acagcggccg gttcagcatg ggcagcgtcg 
tgactgctga gtacgcctgc acaaggaaac 
tgattcagga gaaatttgca ctgatggctc 
acctcacagc agggatgctg gaccaacctg 
tggtgaaggt gttcagctcc gaggccgcct 
tcgggggctt gggctacaca agggactatc 
tcctcctcat cttcgaggga accaatgaga 
tgcagcatgc cggccgcatc ctgactacca 
gcacagtcat ggataccgtt ggccggaggc 
tggggctgac aggcaaccat ggagttgtgc 
ttgaggagaa cacctactgc ttcggccgga 
agaccatcat ggaggagcag ctggtactga 
atggcatgac ggccgtgctg tcgcgggcca 
acgaccacga ggttctcttg gccaacacct 
tcagcctctc tcagctggac aagtatgctc 
tgtcccagca gatccttgag aagcgagcct 
gctgaggcag gggacagtgt cccctgctac 
atgactgtta ctcttttttc agaaggtgtt 
ccgtctgcac ctgaagggtt gtcgcctggc 
caggcagtgc tctctaacag gaccatcaca 
gaattgctga cccctggaac tggcgggtat 
actggggctt atgctgctgc ctccagggtg 
gatagccatt tctgctcaac cacacattct 
tcattcattt aaactagaag cagaggcact 
gaatataaaa tgtcacaatc tgtgtactgt 



gaggctgagg ctggggaaca tcgggcagca 60 
cggctgcggc tcgtgcctgc cggggtctgg 120 
gcaccagccc gcctgtacga gctttcgcca 180 
aagaagtttt cccatttcca gaagttagcc 240 
tgggacccgt ggaaaaattc ttcactgaag 3 00 
ggaaaatccc agatgaaact ttggagaaat 3 60 
tcccagaaga atatggtggc ctgggcttct 420 
tcatcagcat ggatgggtcc atcactgtga 480 
aggggatcat cttggctggc actgaggagc 54 0 
ccggggagca cattgcagcc ttctgcctca 600 
caatccggag cagagccaca ctaagtgaag 660 
aggtctggat tactaatgga ggactggcca 720 
tcgttgattc tgatggatca gtgaaagaca 780 
ttggtggagt cactaatggg aaacccgaag 840 
gtgaagtcca ttttgaaaac accaagatac 900 
atgggtttaa ggtggccatg aacatcctca 9 60 
tggctgggct gctcaagaga ttgattgaaa 1020 
agtttaacaa gaggctcagt gaatttggat 1080 
agaaggctta cgtcatggag agtatgacct 1140 
gctttcccga ctgctccatc gaggcagcca 1200 
ggcagtgtgt gagtgaggcg ctgcagatcc 1260 
cgtacgagcg catactgcgt gacacccgca 1320 
ttctccggat gtacatcgcc ctgacgggtc 1380 
ggatccatga gcttaaacag gccaaagtga 1440 
ttcgggactc cctgggccga actgtggacc 1500 
accctagtct tgcggacagt gccaacaagt 1560 
ccgtggagac actgctgctc cgctttggca 1620 
agcgggtggc caacatcctc atcaacctgt 1680 
gccgctccat ccgcattggg ctccgcaacc 1740 
tctgcgtgga agcttacttg cagaatctct 1800 
cagaaaacct agatgagcag attaagaaag 1860 
atatctgtgc ccaccctctg gacaggacat 1920 
cgcccgcccc tacccatggc ccgttgctgg 1980 
gggattatca caggttaagc cttttgttcc 2040 
ctgggagagc ctcttccagg ttttgacctg 2100. 
gcttctgaac tgagccggag agagagaatg 2160: 
tctggtcatt gaggagacac catagtggaa 2220 
tgaggtgggt ggggacctgt gtcaggtgtg 2280 
ctaagaaaca gcttgaaagc tctgtctggg 2340 
taaaacatgt accaggaacc atttaacaaa 2400 
taaaaaaaaa 2440 



<210> 52 
<211> 1072 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 5596281CB1 

<400> 52 

cggacggtgg gtccagacaa aggttgtctc 
acgccatcag ctcgctgctt aaaattaaac 
gaaagtcatc atcctgacgg ccgctgctca 
tgcaagagaa ggtgccaaag tcatagccac 
ggaaaagtac ccgggtattc aaactcgtgt 
tcagtttgcc agtgaagttg agagacttga 
tcatggaact gtcctggatt gtgaggagaa 
gcgcagcatg tacctgatga tcaaggcatt 
caatattatc aacatgtctt ctgtggcttc 
gtacagcaca accaaggcag ccgtgattgg 
ccagcagggc atcaggtgca actgtgtgtg 



aagtttgttg ctcaaaccga gttctggaga 60 
cacaggttcc attatgggtc gacttgatgg 120 
ggggattggc caagcagctg ccttagcttt 180 
agacattaat gagtccaaac ttcaggaact 240 
ccttgatgtc acaaagaaga aacaaattga 300 
tgttctcttt aatgttgctg gttttgtcca 360 
agactgggac ttctcgatga atctcaatgt 420 
ccttcctaaa atgcttgctc agaaatctgg 480 
cagcgtcaaa ggagttgtga acagatgtgt 540 
cctcacaaaa tctgtggctg cagatttcat 600 
cccaggaaca gttgatacgc catctctaca 660 
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agaaagaata 
aaagacggga 
tgatgaatct 
attttaggat 
tacgaagaaa 
ctttttaatg 
aatctcttct 



caagccagag 
agattcgcaa 
gcttatgtaa 
ctccatggtg 
actcaccaat 
atgtcactgt 
gaaatcattg 



gaaatcctga 
ctgcagaaga 
ctggtaaccc 
ggaaggaagg 
catctccttc 
ttgcaagagt 
taaagaaata 



agaggcacgg 
aatagccatg 
tgtcatcatt 
caggcccttc 
ctgttaatca 
ctgattcttt 
aaaatattga 



aatgatttcc 
ctctgcgtgt 
gatggaggct 
ctatccacag 
catgttaatg 
aagtatatta 
actcaaaaaa 



tgaagagaca 720 
atttggcttc 780 
ggagcttgtg 840 
tgaacctggt 900 
aaaataagct 960 
atctctttgt 1020 
aa 1072 



<210> 53 
<211> 1040 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<223> Incyte ID No: 



5731013CB1 



<400> 53 

ccttcggctc 

gcttaggctc 

cttggccagc 

cttcagcgtc 

ggtgatcgac 

ctgcgtgccc 

ctgcaagtcg 

ggacgtgatc 

gcaccccttc 

tcctgcccca 

ctttggactg 

atggaaccca 

gtttctttcc 

agtggcatcc 

tttaaatcca 

ctgaagaatc 

tgggctaggc 

ccaaggtcaa 



gagcggcaac 
cgcggcctcc 
gttctccaca 
agccgcgacg 
ttcgcccgac 
agagtagtgg 
gtcgaggaga 
cgcatccgca 
accaacaagc 
gcccaggtgc 
ttactccggt 
tcaatgggca 
agtgcctgct 
ctaaaacctg 
aattgataat 
cctgatgtct 
actttatatc 
aaaaaaaaaa 



cggaagcctc 
aagctgtagc 
acggactggg 
gcgcctcgtc 
ggaatccagg 
ccgaatacct 
tctcgacgct 
agcccttcca 
cgaccacgtt 
aagcacagtg 
aaaggtggtt 
agttgacaga 
tctgggggca 
agatgaggaa 
aatgatctca 
tattggaaca 
tgtgtgaata 



gaggtttagt 
tatgacggcg 
tcgctatgtg 
tcgcggcgcc 
ggtcgtaata 
taacggggct 
ggtgcagaag 
caccgacaac 
ccgcgggcta 
aagagttgcc 
cttccccttt 
ggttctgctt 
gtgaccttgt 
gacttcaagg 
aaacacagtg 
acc act gage 
cagatttata 



cccgcccccc 
cgcgggactc 
cagcagctgc 
agggagttcg 
tatgtaaact 
gtgegegagg 
ctggccgacc 
cctagcatcc 
cgcccccgag 
ccaccaactg 
gggattccaa 
gggataatga 
gaaccactca 
gttttacagg 
agaggtctga 
taeggagage 
aaacaggtta 



tctcctcgct 
cgagccgctt 
agegtctgag 
tggagcggga 
cgcgtccgtg 
agagcatcca 
agtegggett 
agggecagtg 
aggttcagga 
cagccccagg 
gcccaggcaa 
agagctgect 
tttttatgea 
acccttgttt 
aggctggctt 
tctgctgtga 
ataaacttat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1040 



<210> 54 
<211> 1624 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<223> Incyte ID No: 



5731162CB1 



<400> 54 

ctcagcctcg 

tatttttagt 

cgtgatccgc 

ggectgeagg 

tatcttgggc 

atggcggcgc 

tttcccggca 

ctgcctgagc 

attcttgtct 

aatggatgtt 

agcccagggc 

gaaaggcgt t 

aattggccac 

ggtgcagacc 

catctttaag 

gtacaeggag 

ggagaaegge 

ggtccgggcc 



cgagcagctg 
agagaegggg 
gcgcctctgc 
tgttgttctg 
ctgaggattg 
ctggactggg 
cctggaggcc 
ccctcgcagc 
gcggaagagt 
cctctccact 
agcacagact 
tttgatgaga 
gctctgcacg 
ttggccagaa 
caacctcact 
cccctgggcc 
tgtctctggt 
cctgttggct 



ggactacagg 
tttcaccgtg 
ctccaaagtg 
accccagctc 
aggectagag 
actcagccca 
gcgcttagaa 
tccagaagtt 
gtgtggccat 
gccgcacaga 
atttcttgag 
aaggaaattt 
cccacgaccc 
gtctgggcct 
ttggcggtga 
gggtgctggg 
tcatccctgg 
cagcgcctgg 



tgcccgccac 
ttagccagga 
ctgggattac 
cactgtgcca 
agaggcaggc 
getgettgge 
gccgcccagt 
ccaacaggat 
gcaacaaagg 
attctccacc 
cagtggtgac 
cctggtccct 
cgtcttcaag 
ccagatgccc 
agtctcccct 
cgtgtggatc 
ctcccacacc 
taccagcttc 



cacgcccggc 
tggtctcgat 
aggcccgagc 
gcccgacttg 
gttcctcaga 
ctgaccctct 
gccctgagcg 
ggattcctgg 
attggegaga 
caggaagagg 
aagattcgat 
ccggagaaat 
agcatcacac 
gtggtggtgc 
catcaggacg 
gcagtggagg 
agtggtgtgt 
cttgggtcag 



taattttttg 
ttcctgacct 
cgccgcaccc 
agatgecaag 
gtcacaggga 
cacagcataa 
tctccatggc 
tgctggaagg 
tagtggctga 
agcagcttcg 
tcttctttga 
ccatcaacaa 
actccttcaa 
agagcatgta 
cctccttcct 
atgccacgct 
caagaaggat 
agccagcccg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



39/40 



WO 01/44448 



PCTYUS00/33158 



ggataacagc ctctttgtgc ccaccccagt 

agaagtggta cacaagagpa agcagaac^t 

ccacctcatg gaggcctctg gcaccacctg 

tgaactgccc tttccccaac tgtacaccta 

ctcccgggtg aagctgtggg ctgtaaacac 

agggaggtct tgtctcccct cctgggcttt 

gggtcagggg cttccctaag atcttcacct 

tgaggaggct tctcagccac caaagggttc 

atggaactct ggttattatg gtgttagtta 
aaaa 



gcagagaggg gccctggtcc tcatccatgg 1140 
ctctgaccgc tcgcgccagg cctacacttt 1200 
gagcccggag aactggctcc agccaacagc 1260 
aaggctctcg cagggcagga gccctcgccc 1320 
cagtgccttg ctcagcctcc tggttgcaac 1380 
cctcctgccc tgtgggcagc agcctaggct 1440 
ctctgcctcc ctactgcccc aacatagcct 1500 
tggccccttc tcactctcct ctcctctcag 1560 
tcgaataaaa acgacttcag aatgaaaaaa 1620 

1624 



40/40 



